Detecting directional couplings from time series is crucial in understanding complex dynamical systems. Various approaches based on reconstructed state-spaces have been developed for this purpose, including a cross-distance vector measure, which we introduced in our recent work. Here, we devise two new cross-vector measures that utilize ranks and time series estimates instead of distances. We analyze various deterministic and stochastic dynamics to compare our cross-vector approach against some established state-space-based approaches. We demonstrate that all three cross-vector measures can identify the correct coupling direction for a broader range of couplings for all considered dynamics. Among the three cross-vector measures, the rank-based variant performs the best. Comparing this novel measure to an established rank-based measure confirms that it is more noise-robust and less affected by linear cross-correlation. To extend this comparison to real-world signals, we combine both measures with the method of surrogates and analyze a database of electroencephalographic (EEG) recordings from epilepsy patients. This database contains signals from brain areas where the patients’ seizures were detected first and signals from brain areas that were not involved in the seizure onset. A better discrimination between these signal classes is obtained by the cross-rank vector measure. Additionally, this measure proves to be robust to non-stationarity, as its results remain nearly unchanged when the analysis is repeated for the subset of EEG signals that were identified as stationary in previous work. These findings suggest that the cross-vector approach can serve as a valuable tool for researchers analyzing complex time series and for clinical applications.
Interacting dynamics are ubiquitous in natural and man-made systems. A basic example is a driver-response dynamics, where the state of the driver influences the temporal evolution of the response, but the response has no impact on the driver. A fundamental problem in time series analysis is characterizing such directional couplings based on the signals measured from the two dynamics. Here, it is essential to not only detect the presence and quantify the strength of the coupling but also to determine its direction. Only in this way can one identify the driving dynamics. A broad variety of approaches have been developed for this purpose. These include families of methods based on Granger causality,1–5 information theory,6–10 instantaneous phases,11–14 and reconstructed state-spaces,15–24 among others. These different families are complementary with regard to the type of time series and dynamics they are best suited for. Within each family, on the other hand, one should strive to find the most powerful method. In this paper, we consider the family of state-space-based methods. We show that by fusing two established methods, one can reach an improved accuracy in characterizing directional couplings in driver-response dynamics. First, we illustrate this advantage under controlled conditions using mathematical model systems and then provide an application to electroencephalographic (EEG) recordings from epilepsy patients.
I. INTRODUCTION
Consider two stationary dynamical systems and that exhibit a self-sustained motion in the absence of a coupling between them. Assume that if there is a coupling, it is unidirectional from to . This means that the state of the driver influences the motion of the response , but has no influence on . The aim of this work is to further improve techniques that allow for the characterization of such driver-response dynamics from the analysis of pairs of time series. Here, one time series is measured from the driver, and the other is measured from the response. Such techniques must be sensitive to and specific for couplings, i.e., they should detect a coupling if and only if it is actually present. In particular, it is essential to detect the correct coupling direction for a broad range of non-zero coupling strengths. We, therefore, use the following notions. Given a coupling , for a technique to be direction-sensitive, it should detect a coupling in this direction. Moreover, to be direction-specific, the technique should not falsely detect a coupling .
Čenys et al.15 gave the key to achieving these goals based on state-space reconstructions from the time series of and . They pointed out that if the states that the response dynamics attains at two times are similar, the states of the driver dynamics at these times are likely also to be similar. The strength of this conditioning increases with increased coupling strength. The conditioning in the other direction, i.e., similarity of states in implying similarity of states in , holds to a significantly lesser degree. This principle is utilized by numerous state-space-based approaches that aim at characterizing the driver-response dynamics.15–24 We here follow Ref. 25 and refer to it as the asymmetric state similarity conditioning principle.
Importantly, the way in which the different state-space-based approaches quantify the asymmetric state similarity conditioning can have a decisive impact on their direction-sensitivity and direction-specificity (see, for example, Refs. 20, 22, and 24). Aiming to optimize this performance, we recently introduced the approach of cross-distance vectors and defined a novel measure, denoted here by , for the detection of directional couplings from the prominence of the initial tails of these vectors.24 We compared this cross-distance vector measure against a state-space-based approach that was introduced and referred to as the nonlinear interdependence measure in Ref. 21. This comparison provided evidence for the advantages of over , including a better direction-specificity.24 However, for both and , the similarity between states is quantified by distances between the corresponding points in reconstructed state-spaces of and . This use of distances can lead to bias caused by asymmetries between and that are unrelated to couplings, such as different effective dimensions of these individual dynamics.22 To diminish this bias, Ref. 22 introduced the nonlinear interdependence measure , which uses ranks of distances instead of the values of the distances. A further very prominent state-space-based approach to characterize coupled dynamics is the convergent cross mapping measure ,23 in which distances are used to construct weights in cross prediction estimates between and . Therefore, the following question arises. Can the performance of the cross-vector approach be improved by using vectors of ranks or vectors of cross prediction estimates instead of vectors of distances? To address this question, we here introduce the cross-rank vector measure and the cross-estimate vector measure . We compare these novel measures against the cross-distance vector measure , the nonlinear interdependence measures and , and the convergent cross mapping measure .
In Sec. II, we start by reviewing the definitions of the previously established measures, ,21, ,22, ,23 and .24 We then introduce the definitions of and . In Sec. III A, we apply all six measures to noise-free unidirectionally coupled nonlinear deterministic dynamics. Since this application shows the advantages of the rank-based measures and over the other measures, we restrict the remaining analysis to these two measures. In Sec. III B, we compare the noise robustness of and . Section III C explores the performance of these measures on short time series. Section III D provides an application to stochastic dynamics and illustrates the effect of linear cross correlation. To compare the measures’ performance on real-world data, an application to electroencephalographic (EEG) recordings from epilepsy patients26 is given in Sec. IV. For this part of our analysis, we combine and with the method of surrogates.21,27,28
II. STATE-SPACE APPROACHES
Now, we introduce the central concept of the th conditioned nearest neighbor of , defined as . If there is coupling from to , due to the asymmetric state similarity conditioning principle, the conditioned nearest neighbors of are commonly also its true neighbors in the sense that they are close in space to . In contrast, in the absence of this coupling, the conditioned neighbors of are typically not close in space to . Keep in mind that we have the pruned distance matrices of size that take into account the Theiler correction and our additional constraint. Therefore, for all points, we have the same number of candidates for both the true nearest neighbors and conditioned nearest neighbors.
All six state-space-based measures used in this work share the initial steps described up to this point. Based on these definitions, they quantify the closeness of the conditioned nearest neighbors in different ways. However, the four measures , , , and also share the subsequent steps. The permutations and are used to construct the following ordered matrices. At first, the permutation is applied to the th row of across resulting in the matrix , where is the distance from to its th nearest neighbor. Then, in analogy, the permutation is applied to the th row of across resulting in the cross-distance matrix , where is the distance from to its th conditioned nearest neighbor. Analogous procedures are used to determine and .
A. Nonlinear interdependence measures M, L
B. Convergent cross mapping measure ρ
All three measures defined thus far reach the maximal value of 1 when the two time series are identical. In the other limiting case of independent dynamics, their values are distributed around 0.
C. Cross-distance vector measure
D. Novel cross-rank vector and cross-estimate vector measures and
Reference 24 reported a substantially better direction-specificity of compared to the distance-based nonlinear interdependence measure . In this paper, we expand the idea of quantifying the vector tails to ranks [analogous to , Eq. (3)] and to time series estimates [analogous to , Eq. (6)]. In what follows, we accordingly introduce two new cross-vectors, namely, the cross-rank vector and the cross-estimate vector.
Let us briefly address the differences between the three variants of the cross-vectors. Despite that the ordinates in the different panels of Fig. 1 have different orders of magnitude, all three variants of vectors result in very similar shapes. However, the relative length of the initial tail and, therefore, the values of the obtained measures , and differ substantially. This difference will be thoroughly discussed in Sec. III A.
III. DIRECTION-SENSITIVITY AND DIRECTION-SPECIFICITY
In this section, we compare the direction-sensitivity and direction-specificity of the nonlinear interdependence measures and , the convergent cross mapping measure , and the three cross-vector approaches. We first analyze noise-free deterministic dynamics in Sec. III A, then study the impact of noise in Sec. III B, explore the performance on short time series in Sec. III C, and, finally, consider stochastic dynamics and the effects of linear cross correlation in Sec III D.
A. Noise-free deterministic systems
We begin by analyzing the dependence of the six measures on the coupling strength between noise-free deterministic dynamics. Recall that each measure has two complementary variants: one to detect couplings from to and the other to detect couplings from to . We analyze three pairs of deterministic chaotic dynamical systems with a mismatch in their parameters, unidirectionally coupled in the direction with diffusive coupling, with strength :
- Rössler systems:
- Lorenz systems:
- Duffing systems:
The exclusion window is samples, the , and measures’ nearest neighbor parameter from Eqs. (2)–(5) is , and the cross-vector measures’ parameters from Eqs. (8), (10), and (12) are . We choose , which allows for an equitable comparison across all six measures, as and both determine the number of nearest neighbors. Following the previous work,24 we set . This previous work showed that setting a too high value of generally decreases the direction-specificity of , and setting to the maximal possible value of makes perform almost identically to . Accordingly, an adequate choice of the parameter is key for good direction-specificity. For simplicity, we use the embedding parameters . As long as is sufficiently large, the product matters more than both individual parameter values. This is because this product determines the temporal distance between the first and the last entry in the embedding vectors [Eq. (1)]. All parameters are kept the same for the analysis of all three deterministic dynamics.
For each system, we vary the coupling strength equidistantly on a logarithmic scale, from a negligibly small value up to and above the threshold that leads to generalized synchronization33 in different dynamics. We, furthermore, include the uncoupled case, i.e., . For each value of , we compute the distribution of the measures’ values across realizations of the dynamics. The six measures are always calculated for the same set of realizations so that the randomness of the initial conditions has no impact on the comparison across measures.
The coupling direction in our systems is . Accordingly, to be direction-sensitive, a measure should detect a coupling . Moreover, to be direction-specific, the measure should not detect a coupling . We assess direction-sensitivity by the minimal coupling strength value , above which the distribution of values of the measure’s variant used to detect a coupling differs significantly from the distribution obtained for the uncoupled case. A measure is judged as more direction-sensitive if it has a smaller . Likewise, we assess direction-specificity by the minimal value of , above which the distribution of values of the measure’s variant used to detect a coupling differs significantly from the distribution at . Since the actual coupling direction is , a measure is more direction-specific if it has a larger . We use the Kolmogorov–Smirnov test at a significance level to assess whether two distributions differ. We determine the values of and as the first of at least five consecutive values of that result in rejections of the Kolmogorov–Smirnov test. The results of the analysis are shown in Fig. 2.
For all measures, the variants used to detect couplings increase with increasing coupling strength and are larger than the complementary variants used to detect couplings across wide ranges of coupling strength. This is to be expected of any directional coupling measure. The values are comparable among five of the six measures. The exception is with slightly smaller values of , which indicates that it is the most direction-sensitive measure. However, it also has the largest variance of the distributions of its values across different realizations, as indicated by broad colored bands in Figs. 2(g)–2(i). This can undermine a better direction-sensitivity in practice as, often, only a small number of realizations of time series pairs is available, and a higher variability of can make the estimation of this measure’s mean value less reliable. The other five measures have not only similar direction-sensitivity but also similar variance across realizations, making them comparable in this aspect.
The value of any of the measures , , or is larger than the value of any of the measures , , or . In most cases, is around ten times larger for the cross-vector measures than for the respective classical counterparts. This means that the measures are substantially more direction-specific and can be used to reliably infer the coupling direction for a broader range of coupling strengths. The most direction-specific out of all six measures are and . Next is , and then , and the least direction-specific measures are and .
It is well-known that the coupling direction is generally not well-defined for synchronized dynamics since we no longer have two separate dynamics. Indeed, the values of measures and are closely below the generalized synchronization threshold. When the coupling strength is further increased above the threshold, all measures lose the ability to infer the true coupling direction.
Recall that we use coupled non-identical chaotic systems that enter into generalized synchronization. Although they cannot show complete synchronization, the resulting two time series become almost identical at very large , and four out of six measures reach values close to 1 in all three dynamics. Conversely, the measure remains considerably below 1 in all the dynamics, and does so for the Duffing dynamics. Furthermore, the values reached by and differ between the three dynamics, which necessitates further interpretation.
Consider computing the measures , , and from two identical time series and . The values of the components of cannot be smaller than the smallest entry of the distance matrix . Additionally, a time series estimate obtained from the th conditioned nearest neighbors cannot perfectly replicate the original time series . This is because even the closest neighbor to a point is never identical to the point itself. Consequently, the similarity of the estimate to the original time series is limited by the closeness of the nearest neighbors. This example shows that the measures and are influenced by the distributions of values in the distance matrices. Therefore, comparing and (or and ) to determine the coupling direction is less equitable when the distributions of values in and are different.
Conversely, the measure does not depend on the distributions of distances. It is derived from ranks, which are uniformly distributed regardless of the pair of time series being analyzed. The entries of the cross-rank vector obtained from two identical time series are , resulting in . This holds for all pairs of identical time series. The measure is, therefore, not influenced by the distributions of values in the distance matrices. Accordingly, it can determine the coupling direction more accurately than and .
In conclusion, the measures and have comparable direction-sensitivity and direction-specificity, while is slightly less powerful in these aspects. Due to the additional advantage of uniform distribution of ranks, we consider the best cross-vector measure. Therefore, we use and its classical counterpart in the subsequent analysis.
B. Noise robustness in deterministic systems
It is important to test if a measure stays reliable when the data contain noise, as the measured data often do. To assess the noise robustness of and , we add measurement noise to the three deterministic systems from Sec. III A. Specifically, we add Gaussian white noise with zero mean and increasing standard deviation . We use noise levels equally spaced on a logarithmic scale starting from , where is the standard deviation of the noise-free time series. For each noise level, we determine the values and of the measures and . The results are shown in Fig. 3, which includes the results for the noise-free dynamics already shown in Fig. 2 for comparison.
As the noise level increases, the value of both measures generally increases for all three systems. The values of and are nearly identical for noise-free data. However, with increasing noise, increases faster for than for for all three systems. This shows that retains better direction-sensitivity than when noise is introduced. On the other hand, when the noise level is large, the values become very similar to . This happens at larger for than for , showing that retains better direction-specificity. Accordingly, allows for correct inference of the coupling direction across a substantially wider range of coupling strength values at all noise levels. Thus, exhibits better overall noise robustness than .
C. Influence of time series length
In real-world applications, the measured time series are often short. This section, therefore, presents the analysis of the three dynamics from Sec. III A for different time series lengths . The time series length is varied linearly on a logarithmic scale. The minimal value is , which corresponds to 6–10 oscillations in these different dynamics. As the maximal value, we use , which is the time series length taken in Secs. III A and III B. The embedding parameters and , as well as the Theiler correction window , remain unchanged. Note that at the shortest , this leaves only candidates for the nearest neighbors in total. Accordingly, we deliberately include a very short time series in this analysis, for which hardly any detection of the coupling can be expected.
For each time series length, we determine the values and of the measures and . Here, care must be taken in choosing the parameters and from Eq. (10), and from Eq. (3). Recall that these parameters determine the number of nearest neighbors. If a time series is half as long, a chosen point will, on average, have half as many neighbors in a neighborhood of a certain size. Using the same number of nearest neighbors would result in using more distant neighbors. Accordingly, if the time series are half as long, the parameters , , and should be half as large in order for the results to be comparable. Therefore, we adjust the parameters and such that the ratios and are kept constant. However, cannot be smaller than , and cannot be smaller than , which ensures the ratio is kept constant for all .
The results are shown in Figs. 4(a)–4(c). First, we observe that the values of of both and decrease with increasing . This is expected, as generally longer time series lead to an increase in detection power. The values of are generally comparable for both measures, indicating that the measures’ direction-sensitivity remains similar between them in short time series. The exception to this are coupled Rössler systems, where has better direction-sensitivity for very short time series. Conversely, continues to be more direction-specific than for all and all three dynamics, as its values of remain substantially larger than those of . An interesting observation is that of decreases with increasing . At first sight, it seems contradictory that the direction-specificity of this measure deteriorates with increasing time series length. It can, however, be understood straightforwardly by the following considerations. Like we have outlined above, the inverse of the asymmetric state similarity conditioning principle (closeness in the driver maps to closeness in the response ) holds to a much weaker degree than the principle itself (closeness in the response maps to closeness in the driver ). Although it is weaker, the inverse conditioning still holds and is quantified by . The quantity is the coupling at which this inverse conditioning is detected by . Accordingly, the deterioration of ’s direction-specificity with increasing time series length is explained by an increased sensitivity of for the inverse conditioning, causing the incorrect detection of a coupling from the response to the driver. Conversely, the values of remain close to the generalized synchronization threshold for all . This shows that is much less affected by this inverse conditioning, further explaining its better direction-specificity.
To illustrate the importance of adequately setting the nearest neighbor parameters, Figs. 4(d)–4(f) show the results obtained for fixed values and .36 We can see that direction-sensitivity is comparable to the results for the adjusted parameters seen in Figs. 4(a)–4(c). However, the direction-specificity values of are much smaller when the parameters are not adjusted to the changing . This shows the negative effect of a poor choice of parameters on direction-specificity.
D. Stochastic dynamics and the effect of linear cross correlation
We have shown that the measures and allow one to detect directional couplings in nonlinear deterministic dynamics. However, it is well-known that coupling detection measures can also be influenced by other time series properties. In particular, a strong linear cross-correlation between the time series and can result in high values of the measures. The problem here is that even in the absence of coupling, one can obtain an arbitrary degree of linear cross-correlation, for example, by mixing the time series originating from independent dynamics. Furthermore, it is also important to assess how these measures behave in stochastic systems, as randomness can potentially affect the reliability of the measures. To assess the effects of linear cross-correlation and randomness on our measures, we analyze linear and nonlinear stochastic processes.
For the two systems (17) and (18), we vary and , respectively, on a linear scale and generate realizations for each value. One realization results in a pair of time series, each consisting of samples. As embedding parameters, we use and . The exclusion window is , the measure’s parameter from Eq. (3) is , and the measure’s parameters from Eq. (10) are for all cases.
For system (17), we evaluate the direction-specificity by determining the minimal at which the results differ significantly from those at . We use the same statistics and criteria as for determining , resulting in the analogous quantity . We can, however, not determine any quantity analogous to since system (17) has no coupling and we cannot evaluate the measures’ direction-sensitivity.
Figure 5(a) shows the results of the analysis of system (17). The two time series and are always generated by a mixing of independent dynamics (16). In the limit of , the time series and remain independent. In consequence, the mean values of the directional coupling measures are very close to zero. As increases and with it the cross-correlation, both and also increase. This illustrates the previously mentioned impact of the linear cross-correlation on coupling detection measures. This can be regarded as a limited specificity of these measures since arbitrary degrees of cross-correlation can be obtained by mixing signals from uncoupled dynamics. At complete mixing, i.e., , the time series and are identical, and accordingly, both measures reach their respective maximal values. Figure 5(a) also shows that is less affected by linear cross-correlation than . First, the value of is bigger for than for . This shows that for , a stronger cross-correlation is needed to cause a significant difference between results with and without a cross-correlation. The second aspect is mean values of the measures obtained across multiple realizations of the dynamics for a given value of . Upon increasing , the curve of stays close to zero longer and increases slower than the one of . This is of particular relevance for the application to real-world dynamics where often there is no controllable parameter like , and the mean value of the measures across realizations for a fixed setting is all that is available.
Figure 5(b) shows the results of the analysis of system (18). Recall that the coupling direction is . The mean values of and both increase with increasing and are substantially larger than the mean values of and , respectively. This shows that both and can be used to determine the coupling direction in stochastic systems. Comparing the values of reveals that is more direction-sensitive in this scenario. However, is once again found to be more direction-specific than , as indicated by the mean values of increasing above zero, while the mean values of remain very close to zero. Furthermore, the value of is only slightly larger than its , which means that detects the correct direction for only a narrow range of coupling strengths. In contrast, the values obtained for are significantly different from those at only at two non-consecutive coupling strength values, so that remains undefined. Accordingly, also in this setting, exhibits strong direction-specificity.
Finally, an important observation can be made by analyzing the cross-correlation values in Fig. 5(b). The mean cross-correlation between and stays close to zero regardless of the value of . However, still increases with increased , even though the coupling direction is . This shows that strong couplings from to can cause an increase of the measure , which is supposed to detect a coupling from to , even without a linear cross-correlation. This lack of direction-specificity in the absence of linear cross-correlation is not found for .
IV. APPLICATION TO EEG RECORDINGS
In this section, we apply the rank-based measures and to electroencephalographic (EEG) recordings from epilepsy patients to assess the measures’ applicability to real-world data. We use the publicly available Bern–Barcelona EEG database.26 In the initial analysis of this database, the measure was combined with bivariate surrogates.26 We here extend this analysis by including not only but also . We, therefore, at first, briefly describe the database in Sec. IV A, outline the concept of surrogates in Sec. IV B, and describe details of our EEG analysis and results in Sec. IV C.
A. The Bern–Barcelona EEG database
The database consists of 7500 pairs of signals extracted from intracranial EEG recordings from five epilepsy patients. Each signal pair was recorded simultaneously at the neighboring EEG channels and has a length of s at a sampling frequency of Hz. Half of the pairs were recorded from brain areas where the first seizure-related EEG signal changes were detected for individual patients (focal signal pairs). The other half were recorded from brain areas that were not involved in patients’ seizures at their onset (nonfocal signal pairs). Focal and nonfocal signal pairs were picked randomly across patients, time, and channel pairs within the respective brain areas. The location of the seizure onset zone determines whether a given signal pair is focal or nonfocal. However, all signals were taken from recordings of the seizure-free time interval. In other words, neither the 3750 focal nor the 3750 nonfocal signal pairs include actual seizure activity. An exemplary pair of focal signals is shown in Fig. 6(a). For a more detailed description of the signal acquisition, composition, and preprocessing, the reader is referred to the paper.26
B. Surrogate signals
In Sec. III, we use model systems under controlled conditions. This allows us to directly obtain null distributions of the measures expected for uncoupled and independent dynamics. For the deterministic dynamics and the nonlinear stochastic dynamics, we generate uncoupled dynamics by setting in their respective Eqs. (13), (14), (15), and (18). Analogously, for the linear mixing of independent autoregressive processes, we generate the non-mixed system by setting in Eq. (17). Our analysis in Sec. III is then based on statistical comparisons of the results obtained for these null distributions against results obtained for nonzero couplings ( ) and nonzero mixing ( ).
When dealing with experimental data, it is typically not possible to generate signals for uncoupled and independent dynamics. This problem can be addressed by the concept of surrogate data.21,27,28 Surrogates are a very powerful and flexible tool in signal analysis. They allow estimating the null distribution of measures expected under well-defined null hypotheses about the dynamics underlying some measured data. For this purpose, a constrained randomization is applied to the original data, resulting in one realization of the surrogate data. In particular, the randomization constraints are defined such that the surrogate data preserve all properties of the original data that are consistent with the null hypothesis but are otherwise random.
In our case, the data are given by a pair of time series. We construct pairs of surrogate time series that have the amplitude distribution and auto-correlation of the individual time series, as well as the cross-correlation between the time series, in common with the original pair of time series. All other properties of the time series, including possible signatures of the asymmetric state similarity conditioning between nonlinear dynamics, are destroyed by randomization. This corresponds to the null hypothesis that “the dynamics is a stationary bivariate linear stochastic correlated Gaussian process. The measurement functions by which the pair of time series was derived from the dynamics are invertible but potentially nonlinear. The auto-correlation, cross-correlation, mean, and variance of the underlying Gaussian process are such that the measurement results in the auto-correlation, cross-correlation, and amplitude distribution of the observed time series.”26
To generate the surrogates, we use the bivariate iterative amplitude adjusted Fourier transform algorithm described in Ref. 28. In line with Ref. 28, our choice to use this particular surrogate algorithm is motivated by a previous study of EEG recordings from epilepsy patients.37 Apart from the surrogates used here, Ref. 37 tested time-shifted surrogates, which represent the null hypothesis that the dynamics are independent and not cross-correlated. No assumption is made about the nature of individual dynamics. Accordingly, time-shifted surrogates maintain the individual signals’ structure and autocorrelation but not the cross-correlation between signals. Reference 37 showed that a combination of with the surrogates used in the present study allows one for a much better discrimination between focal signals and nonfocal signals than a combination of with time shifted surrogates. It is, therefore, essential to not only preserve the autocorrelation but also the cross correlation of the signals in our study of the pairs of EEG signals. An exemplary pair of surrogate time series constructed for the EEG signals shown in Fig. 6(a) is displayed in Fig. 6(b).
To test , we calculate the measures and for the original pair of time series and a set of 19 independent realizations of pairs of surrogate time series. We reject the null hypothesis if a measure’s value obtained for the original pair of time series is larger than for any of its surrogates, corresponding to a one-sided rank-order test with significance level .28
C. EEG analysis and results
Unlike the analysis in Ref. 26, we did not downsample the data by a factor of four. We, therefore, took a four times larger time embedding window and autocorrelation exclusion window . Thus, we use embedding parameters , , and . We use as we did in Sec. III since it is the temporal distance between the first and the last entry in the embedding vectors that matters. The algorithm parameters are . Figures 6(c) and 6(d) show the cross-rank vectors obtained for the exemplary pair of EEG signals [Fig. 6(a)] along with the vectors obtained for surrogate signals pairs [one of which is shown in Fig. 6(b)]. Figures 6(e)–6(h) show the corresponding and indices.
Recall that the signal pairs of the Bern–Barcelona EEG database were selected randomly from different patients, times, and brain areas. Moreover, the assignment to and was random for each pair. Hence, the directions and carry no particular meaning for this real-world data. Therefore, the authors of Ref. 26 did not consider the aspect of direction and used a symmetrized measure to quantify the overall strength of the interdependence: . In order to compare our results to the ones of Ref. 26, we adopt this approach and define .
. | All . | Stationary . | ||||
---|---|---|---|---|---|---|
. | pf . | pn . | D . | pf . | pn . | D . |
LS | 0.511 | 0.370 | 0.160 | 0.462 | 0.264 | 0.273 |
cr,S | 0.107 | 0.056 | 0.313 | 0.107 | 0.058 | 0.297 |
. | All . | Stationary . | ||||
---|---|---|---|---|---|---|
. | pf . | pn . | D . | pf . | pn . | D . |
LS | 0.511 | 0.370 | 0.160 | 0.462 | 0.264 | 0.273 |
cr,S | 0.107 | 0.056 | 0.313 | 0.107 | 0.058 | 0.297 |
To study the influence of non-stationarity on the proportions of rejections of , Ref. 26 introduced a surrogate-based test for the temporal variability of the signals’ amplitude distribution, power spectrum, and cross-correlation between signals. The test was designed to be very strict, as a rejection with any one of the three features was considered a rejection of the stationarity test. Using this test, and focal signal pairs were classified as stationary and non-stationary, respectively. For the nonfocal signal pairs, these numbers were found to be and , respectively. These classification results are publicly available along with the EEG data.38 The last three columns of Table I show the proportions of rejections we obtained exclusively for the subsets of stationary signals along with the resulting contrasts .
For , the proportions of rejections decrease substantially when excluding the non-stationary signal pairs. This holds especially for nonfocal signals. Consequently, the contrast of increases from to . Conversely, for , only the proportion of rejections for the nonfocal signals increases slightly, resulting in a small decrease from to . This shows that is much less sensitive to non-stationarity. Accordingly, with or without the exclusion of non-stationary signal pairs, the contrast between focal and nonfocal signals is higher for than for .
V. CONCLUSION
In this paper, we considered the problem of inferring the strength and the direction of couplings from measured bivariate time series. This work is a continuation of Ref. 24, in which we introduced a new state-space-based directional coupling measure , which is derived from the tails of the cross-distance vectors . Expanding on this concept, we here introduced two novel measures and . They are obtained by quantifying the tails of the cross-rank vectors and the cross-estimate vectors , respectively.
We first studied the direction-sensitivity and the direction-specificity of the cross-vector measures , , and , the nonlinear interdependence measures 21 and ,22 and the convergent cross mapping measure .23 Our analysis of simulated coupled dynamics shows that all three cross-vector measures allow for considerably more reliable detection of the coupling direction for a wider range of dynamics compared to the other measures. This higher direction-specificity is the main advantage of this cross-vector approach. Furthermore, while the convergent cross mapping measure demonstrates slightly higher direction-sensitivity than the other five measures, it also exhibits the largest variance across different realizations of the dynamics. The other five measures have comparable direction-sensitivity. The measure performed at least as well as and in all aspects. The fact that it is computed from ranks provides additional advantages. It can reach values closer to and is more robust to asymmetries between and since the distributions of ranks are uniform for all time series. Therefore, also allows for more equitable testing with surrogates than or , as the distributions of ranks are identical for both original and surrogate time series. This does not hold for the distributions of distances or cross-correlation values between the original and estimated time series. Hence, we consider as the most promising cross-vector measure and, thus, studied its performance further. In particular, we compared it to the established rank-based measure to ensure a fair evaluation, given that both measures are rank-based.
We investigated the impact of measurement noise on the two chosen measures. As noise levels increase, the measure retains better direction-specificity, while preserves better direction-sensitivity. Overall, allows for a correct inference of the coupling direction across a substantially wider range of coupling strength than for noisy time series. This shows that is a better choice when discerning the coupling direction from noisy data. Furthermore, we explored the performance of the rank-based measures on short time series. We found that slightly outperformed in terms of direction-sensitivity in very short time series. However, was once again found to be substantially more direction-specific than for all time series lengths considered. Accordingly, is more reliable in inferring the correct coupling direction also for short time series. Additionally, we analyzed the signals generated by the linear mixing of two independent stochastic dynamics, which allowed us to introduce an arbitrary degree of linear cross-correlation to the pair of time series. Our results show that is substantially less affected by linear cross-correlation than . We conjecture that this is the key factor contributing to the better direction-specificity of the cross-vector measure. However, this is not the sole factor, as further analysis reveals that is significantly more direction-specific than even for two nonlinearly coupled stochastic dynamics with zero linear cross-correlation. This is an important finding, suggesting that the measure is unaffected by multiple properties of the time series that are not directly related to directional coupling.
Finally, we applied the rank-based measures and to the Bern–Barcelona EEG database,26 a publicly available database of EEG signals from epilepsy patients. Previous studies using state-space-based measures have shown that focal signals exhibit a larger degree of nonlinear interdependence compared to nonfocal signals during the seizure-free interval.18,26,37,39 In this paper, we build upon the study in Ref. 26, which used the measure combined with bivariate surrogates, by including the measure . Our analysis shows that can better distinguish between focal and nonfocal signals compared to , as shown by a larger contrast . Excluding those signals classified as non-stationary increases the contrast of close to that of . Conversely, the results for remain nearly unchanged. This is an important finding, as it reveals that is much less influenced by non-stationarity in the analyzed time series compared to .
An important point needs to be made about bidirectional couplings. Recall that state-space-based measures were introduced using the asymmetric state similarity conditioning principle, which enables them to quantify the strength of unidirectional couplings in driver-response dynamics. In practice, they can also accurately quantify the influence of both directional couplings and when the coupling is bidirectional.24 However, one should be cautious about asymmetries between the two systems and in such scenarios, as these measures quantify the coupling impact rather than the coupling strength parameter from the governing equations.40
In conclusion, the new measure demonstrates substantially improved accuracy in inferring the coupling direction, compared to the established state-space approaches we considered. To specifically infer the direction of coupling, has proven to be the most reliable choice across the various settings we examined. Additionally, it shows great potential for applications to real-world data, as illustrated by our study of focal and nonfocal EEG signals from patients with epilepsy. Furthermore, its robustness to non-stationarity reflected in time series eliminates the need to exclude such signals, enabling more effective use of valuable data. In closing, our study offers a new perspective on utilizing the asymmetric state similarity conditioning principle for the detection of directional couplings, which provides an interesting theoretical insight. We hope this advancement will spark further research and inspire additional investigations in this area.
ACKNOWLEDGMENTS
M.B. and P.B. acknowledge the Research Core Funding No. P2-0001 and the Research Project No. J3-4525, both financially supported by the Slovenian Research and Innovation Agency. R.G.A. acknowledges funding from the Spanish Ministry of Science and Innovation and the State Research Agency (Grant No. PID2020-118196GBI00/MICIU/AEI/10.13039/501100011033).
AUTHOR DECLARATIONS
Conflict of Interest
The authors have no conflicts to disclose.
Author Contributions
Martin Brešar: Conceptualization (supporting); Formal analysis (lead); Investigation (lead); Methodology (lead); Software (lead); Validation (equal); Visualization (lead); Writing – original draft (equal). Ralph G. Andrzejak: Conceptualization (lead); Data curation (lead); Methodology (supporting); Supervision (equal); Validation (equal); Writing – original draft (equal). Pavle Boškoski: Investigation (supporting); Resources (lead); Software (supporting); Supervision (equal); Writing – original draft (supporting).