Weak ties play a significant role in the structures and the dynamics of community networks. Based on the contact process, we study numerically how weak ties influence the predictability of epidemic dynamics. We first investigate the effects of the degree of bridge nodes on the variabilities of both the arrival time and the prevalence of disease, and find out that the bridge node with a small degree can enhance the predictability of epidemic spreading. Once weak ties are settled, the variability of the prevalence will display a complete opposite trend to that of the arrival time, as the distance from the initial seed to the bridge node or the degree of the initial seed increases. More specifically, the further distance and the larger degree of the initial seed can induce the better predictability of the arrival time and the worse predictability of the prevalence. Moreover, we discuss the effects of the number of weak ties on the epidemic variability. As the community strength becomes very strong, which is caused by the decrease of the number of weak ties, the epidemic variability will change dramatically. Compared with the case of the hub seed and the random seed, the bridge seed can result in the worst predictability of the arrival time and the best predictability of the prevalence.

In community networks, the links that connect pairs of nodes belonging to different communities are defined as weak ties. The weak ties hypothesis, which is first proposed by Granovetter, is a central concept in the social network analysis. Weak ties not only play a role in effecting social cohesion but also are helpful for stabilizing complex systems under most conditions. Most recent research results showed that weak ties have significant impacts on spreading dynamics. But until now, no study on the effects of weak ties on the predictability of epidemic dynamics has been given to us. In this study, we investigate how the degree of bridge nodes and the number of weak ties influence the predictability of the epidemic dynamics on a local community. We show numerically that both the degree of bridge nodes and the network modularity are crucial in the predictability of the epidemic spreading on the local community. More importantly, we find out that the variability of the arrival time always displays a complete opposite trend to that of the prevalence, which implies that it is impossible to predict the epidemic patterns in the early stage of outbreaks accurately. This work provides us further understanding and new perspective in the effect of weak ties on epidemic spreading.

Community structures at mesoscale level are ubiquitous in a variety of real-world complex systems,1 such as Facebook,2 YouTube,3 and Xiaonei.4 In general, there are more connections between members in the same community than that between members from different communities, where the links that connect pairs of nodes belonging to different communities are defined as weak ties.5–7 The weak ties hypothesis, which is first proposed by Granovetter,8 is a central concept in the social network analysis.9,10 Weak ties not only play a role in effecting social cohesion11 but also are helpful for stabilizing complex systems under most conditions.12 Recently, weak ties have been shown to destabilize ecosystems under specific conditions.13 

Epidemic spreading,14–17 a fundamental dynamical process, is one of the most important subjects in the complex network theory.18–22 Inspired by the significant effects of weak ties on the dynamics on networks,12 many recent works have contributed to understanding the interplay between weak ties and spreading dynamics on community networks.23–27 Onnela et al. found that weak ties can significantly slow the diffusion process, leading to the dynamic trapping of information on communities.28,29 As weak ties are removed gradually, the coverage of information will drop sharply.30 On adaptive networks, strong communities with weak ties may prevent disease propagation.31,32

In order to assess the accuracy and the forecasting capabilities of numerical models, the predictability of outbreaks has been investigated in many studies.33–37 Colizza et al. studied the effect of airline transportation network on the predictability of epidemic patterns by means of the normalized entropy function38 and found that the heterogeneous weight distribution contributes to enhancing the predictability. Crépey et al. found that initial conditions such as the degree heterogeneity of the initial seed can induce a large variability on the prediction of the prevalence.39 Loecher and Kadtke argued that the random walk centrality (RWC) serves as a better index than degree to predict the prevalence of disease.40 Comparing the scale-free network (SFN) with community structure with the random SFN, the predictability of its global prevalence was found to be better.41 Considering the relative independence of a local community, Gong et al. investigated the prevalence and its variability on the local community, and found that the extraordinarily large variability in the early stage of outbreaks made the prediction of epidemic spreading hard.42 In addition, Zhao et al. studied how the heterogeneous time delay (HETD) associated with geographical distance influences the spreading speed and the variability of the prevalence. Owing to the correlations between time delay and network hierarchy in the HETD, the epidemic spreading is slowed down obviously and the predictability of the prevalence is reduced remarkably.43 

On community networks, as mentioned above, weak ties play a very significant role in epidemic dynamics. But until now, there is no study on the effects of weak ties on the predictability of epidemic dynamics. In this paper, we investigate how weak ties influence the predictability of the epidemic dynamics on community networks. We show numerically that both the degree of bridge nodes and the number of weak ties can remarkably influence the predictability of the epidemic spreading on a local community. More importantly, we find out that the variability of the arrival time always displays a complete opposite trend to that of the prevalence, which implies that it is impossible to predict the epidemic spreading in the early stage of outbreaks accurately.

The paper is organized as follows. In Sec. II, we briefly describe the dynamical process on a community network and introduce the quantitative measurements of predictability. In Sec. III, we investigate the effects of the degree of bridge nodes on the predictability of the dynamics. In Sec. IV, the effects of the number of weak ties on the predictability are analyzed. Finally, we draw conclusions in Sec. V.

To investigate the effects of weak ties on the predictability of epidemic dynamics, we must first identify which links on community networks are weak ties. Unfortunately, there is not a generally accepted and authenticated community detection algorithm;18,48,49 thus, it is difficult to identify weak ties accurately on real-world networks. We here consider a community network model comprised two confined communities A and B. Except for the community structure, degree heterogeneity is another important feature of real-world community networks.44–47 In this study, we focus on the community network with degree heterogeneity. To be specific, two independent Barabási-Albert (BA) scale-free networks50,51 with the same size are first produced, and then these two networks are connected by few links. In order to normalize the terms of community network, we define the links between two communities as weak ties,28 and call the nodes connected by these weak ties bridge nodes.11,52 Obviously, the network has a strong community structure because of few weak ties. With the increase of the number of weak ties, the community structure will be weakened.

The network modularity Q, a popular evaluating indicator in measuring the community structure,1 is defined as

(1)

where ls and ds represent the number of intra-links and the sum of degrees of the nodes in community s, respectively; L denotes the total link number in the network; and C is the number of communities. Here, 0Q1, the larger Q is, the stronger community structure is. However, the Q cannot accurately characterize the community strength of a network with two communities.32 To address the shortcoming, the normalized Qn is defined as

(2)

where Qrand corresponds to the random network with the same degree sequence, and Qmax is the modularity of the network without inter-community links. After the normalization, Qn is in range [0, 1].

In many real-world spreading processes, each node's potential infection-activity is limited, and is not strictly equal to its degree.53 For epidemic spreading on human contact networks, although a hub individual has many acquaintances, he/she cannot contact all his/her acquaintances once within one time step.54,55 Thus, a contact process (CP) model with identical infectivity is proposed to study the epidemic spreading on complex networks,55–65 where each infected node can only contact one of its neighbors per unit time. For simplicity, we only study susceptible-infected (SI)15 spreading dynamics through numerical simulations, which is a classical model of the CP. In the model, “S” and “I” represent, respectively, the susceptible (or healthy) and the infected states. At the beginning, a node is selected as the initial infected (i.e., seed) and all other nodes are in “S” state. At each time step, each infected node randomly contacts one of its neighbors, and then the contacted neighboring node will be infected with probability λ if it is in the healthy state, or else it will retain its state. Once an individual is infected, it will keep its state forever. To eliminate the stochastic effect of disease propagation, we set λ=1.

Although other disease models such as susceptible-infected-recovered (SIR) are practically more relevant and realistic, the inclusion of more processes and parameters such as the recovery rate μ in SIR model complicates the analysis of the problem considered here. For example, in epidemic spreading with unlimited infectivity, the initial seed with a larger degree will result in a higher prevalence due to the existence of recovery rate in SIR model.66 However, the simple SI model can still be adopted to well describe the early dynamics of epidemic outbreaks, such as the acquired immune deficiency syndrome (AIDS), gonorrhea, and syphilis.15 Owing to the SI model, the effects of different contact patterns on epidemic spreading can be clearly understood.

In view of the relative independence of a local community, we take the dynamics and its variability into account. When a disease emerges on the community network, it is very important for a local community to keep a watchful eye on two statistical parameters: the arrival time and the prevalence of disease, where the arrival time of disease ta is defined as the moment that infectious individual first occurs on the community in each realization, and the prevalence i(t) is the density of infected individuals at time t. In order to investigate the predictability of epidemic dynamics, the variability of the arrival time (the prevalence) is defined as the relative variation of the arrival time (the prevalence) given by39,42,43

(3)

and

(4)

Δ(ta)=0(Δ[i(t)]=0) denotes that all independent dynamic realizations are essentially the same, and the arrival time (the prevalence) on the network is deterministic. The larger Δ(ta)(Δ[i(t)]) means the worse predictability that a particular realization is far from average over all independent realizations.

Because of the degree heterogeneity of the community network, there may be different degrees of bridge nodes, that is, a pair of bridge nodes (i.e., bA and bB) connected by a weak tie may have different degrees kbA and kbB. In the CP, weak ties with the different combinations of kbAkbB have different effects on the propagation from the first community A to the second community B. As a first step towards this, we consider the case of one weak tie between two communities. Reference 42 showed that once the spreading starts on the second community, the seed is irrelevant. That is to say, different kbB hardly affect the epidemic spreading on the second community B. Therefore, by varying the kbA only, we investigate how different kbA influence the predictability of the epidemic dynamics on the second community B. In simulations, a node with the fixed degree kbA in the first community A is connected with a randomly chosen node in the second community B.

Fig. 1 shows the mean arrival time ta and its variability Δ(ta) on the second community B when different degrees of bridge nodes are created. When the bridge node bA of the first community A is chosen as the seed, that is d = 0 (d is the geodesic distance from the initial infected node to the bridge node bA), the mean arrival time ta is linear with the degree of the bridge node kbA. As the bridge node of the second community bB is one neighbor of the bridge node bA, it will be infected with probability 1/kbA at each time step, which is obviously a Poisson process. So the mean arrival time ta is equal to kbA and its relative variation is σ(ta)/ta1. When the disease seed is a node with one step to the bridge node bA (i.e., d = 1), the mean arrival time ta for different kbA will increase compared with the case of d = 0. For a small kbA,ta is significantly greater than that of d = 0, e.g., ta(d=1)15ta(d=0)5 for kbA=5. But for a large kbA, the relative change of ta is very little, e.g., ta(d=1)248>ta(d=0)240 for kbA=242. The reason is that the infection of the bridge node bB can be divided into two processes: The bridge node bA is first infected in t110 (t1 is the time duration of the first process), e.g., ta(d=1)ta(d=0)10 for kbA=5 and ta(d=1)ta(d=0)8 for kbA=242, and then the bridge node bB is infected by the bridge node bA in t2kbA (t2 is the time duration of the second process). With the further increasing of the distance from the seed to the bridge node bA (such as d = 3 and 4), the mean arrival times ta are nearly the same for the fixed kbA. It can be understood that owing to the finite size effect of the network with the average shortest path length L3.7, the bridge node bA is infected till an overall outbreak emerges on the first community A.

FIG. 1.

The mean arrival time ta and its variability Δ(ta) as a function of the degree of the bridge node kbA where the “squares,” “circles,” “triangleups,” “triangledowns,” and “diamonds” denote the cases of the seeds with d = 0, 1, 2, 3, and 4, respectively. (a) ta versus kbA, (b) Δ(ta) versus kbA. The parameters are chosen as NA=NB=0.5×104,k=10,andλ=1. We perform the experiments on 103 different networks, each of which are tested in 103 independent realizations.

FIG. 1.

The mean arrival time ta and its variability Δ(ta) as a function of the degree of the bridge node kbA where the “squares,” “circles,” “triangleups,” “triangledowns,” and “diamonds” denote the cases of the seeds with d = 0, 1, 2, 3, and 4, respectively. (a) ta versus kbA, (b) Δ(ta) versus kbA. The parameters are chosen as NA=NB=0.5×104,k=10,andλ=1. We perform the experiments on 103 different networks, each of which are tested in 103 independent realizations.

Close modal

In Fig. 1(b), the variabilities of the arrival time Δ(ta) for different kbA are shown. When d = 0, Δ(ta) for different kbA are approximately equal to 1 because these infections are Poisson processes. Interestingly, compared with the case of d = 0, Δ(ta) for d = 1 decreases at each kbA, that is to say the further distance to the initial seed can lead to the better predictability of the arrival time. As mentioned above, the infection of the bridge node bB has two processes, and thus its variability Δ(ta) can be written as

(5)

where t1 and t2 denote the time durations of the first process and the second process, respectively. Substituting it into Eq. (3), we obtain

(6)

where D(t1+t2)=(t1+t2)2t1+t22. Considering the independence of these two processes, Eq. (6) is reduced to

(7)

where D(t1) and D(t2) are the time variances of the first process and the second process, respectively.

In the first process, there are two basic spreading pathways through which the bridge node bA may be infected. As shown in Fig. 2, the bridge node may be infected directly by the initial seed (i.e., a neighboring node of the bridge node bA) with probability 1/ks; the other route is an indirect transmission of infection from its neighboring nodes except the seed when the overall outbreak occurs on the first community A. Although the first route is a Poisson process, the variability Δ(t1) in t1 will be less than 1 due to the approximate determinacy of the second pathway. In the second process, the variability Δ(t2)1 because the infection in t2 is a Poisson process. As D(t)=[Δ(t)t]2, we have

(8)

Obviously, Δ(ta) must be less than 1 when d = 1. As t2=kbA increases with kbA,Δ(ta) will also increase according to Eq. (8), which is verified by the results in Fig. 1(b). Especially for a very large kbA, the variability is very close to 1. It means that although the large degree of the bridge node can delay the mean arrival time of disease, it causes the worst predictability of the arrival time. When d2,Δ(t1) of the first process will be more determined. Thus, Δ(ta) for a small kbA will become small, e.g., Δ(ta)0.31(d=4)<Δ(ta)0.52(d=1) for kbA=5. The above results demonstrate that when the degree of the bridge node is small, the further distance of the initial seed to the bridge node can result in a better predictability of the arrival time due to the approximate determinacy of the first process, while the variability Δ(ta)1 is almost not affected for the very large kbA because of the Poisson property of the second process.

FIG. 2.

Two spreading pathways through which the bridge node may be infected. The first one is a direct transmission from the seed to the bridge node, and the second one is an indirect transmission from the seed to node i, j, and then to the bridge node.

FIG. 2.

Two spreading pathways through which the bridge node may be infected. The first one is a direct transmission from the seed to the bridge node, and the second one is an indirect transmission from the seed to node i, j, and then to the bridge node.

Close modal

Next, we focus on the statistical parameter Δ[i(t)]. Reference 42 showed that the variability of the prevalence on a local community is very large at the beginning of outbreaks due to the uncertain arrival time of disease, which makes the prediction of the prevalence hard. For this reason, we pay attention to the variability of the prevalence in the early stage of outbreaks. In Fig. 3(a), the mean prevalence i(T) at T = 20 decreases with kbA and d, which is a complete opposite of the trend of ta in Fig. 1(a). But Fig. 3(b) shows that its variability increases with kbA and d, which is in accordance with the trend of ta. On the one hand, as the very large variability of the prevalence in the early stage is originated from the uncertain arrival time of disease,42 it is difficult for the large kbA (corresponding to the large ta) to make sure the arrival of disease within T = 20. It will result in the large variability of the prevalence, e.g., Δ[i(T)]5.40(d=0) for kbA=242. On the other hand, the further distance of the initial seed to the bridge node bA also makes the disease more difficult to arrive at the bridge node bB, and can thus cause larger Δ[i(T)], e.g., Δ[i(T)]5.60(d=4)Δ[i(T)]0.60(d=0) for kbA=5. The results at other T values such as 5, 10, and 30 confirm the conclusion. These results imply that both the large degree of bridge node and the further distance of the initial seed can make the prediction of the prevalence very hard.

FIG. 3.

At T = 20, the mean prevalence i(T) and its variability Δ[i(T)] as a function of the degree of the bridge node kbA where the “squares,” “circles,” “triangleups,” “triangledowns,” and “diamonds” denote the cases of the seeds with d = 0, 1, 2, 3, and 4, respectively. (a) i(T) versus kbA, (b) Δ[i(T)] versus kbA. The parameters are chosen as NA=NB=0.5×104,k=10,andλ=1. We perform the experiments on 102 different networks, each of which are tested in 103 independent realizations.

FIG. 3.

At T = 20, the mean prevalence i(T) and its variability Δ[i(T)] as a function of the degree of the bridge node kbA where the “squares,” “circles,” “triangleups,” “triangledowns,” and “diamonds” denote the cases of the seeds with d = 0, 1, 2, 3, and 4, respectively. (a) i(T) versus kbA, (b) Δ[i(T)] versus kbA. The parameters are chosen as NA=NB=0.5×104,k=10,andλ=1. We perform the experiments on 102 different networks, each of which are tested in 103 independent realizations.

Close modal

From Subsection III A, we know that the degree heterogeneity of the bridge node bA has a significant impact on the predictability of epidemic dynamics. Here, we investigate the effects of different degrees of the initial seed on the variability of the epidemic dynamics on the second community B when the distance between the initial seed and the bridge node bA is d = 1. As shown in Figs. 4 and 5, when kbA=5 is small, Δ(ta) and Δ[i(T)] are obviously affected by the degree of the seed, while there is almost no effect on the variability when kbA=242 is large.

FIG. 4.

When the distance between the initial seed and the bridge node d = 1, the mean arrival time ta and its variability Δ(ta) as a function of the degree of the initial seed, where ta versus k for kbA=5 (a) and kbA=242 (b), Δ(ta) versus k for kbA=5 (c) and kbA=242 (d). The results are averaged over 102×103 independent realizations on 102 networks.

FIG. 4.

When the distance between the initial seed and the bridge node d = 1, the mean arrival time ta and its variability Δ(ta) as a function of the degree of the initial seed, where ta versus k for kbA=5 (a) and kbA=242 (b), Δ(ta) versus k for kbA=5 (c) and kbA=242 (d). The results are averaged over 102×103 independent realizations on 102 networks.

Close modal
FIG. 5.

When d = 1, the mean prevalence i(T) and its variability Δ[i(T)] at T = 20 as a function of the degree of the initial seed, where i(T) versus kfor kbA=5 (a) and kbA=242 (b), Δ[i(T)] versus k for kbA=5 (c) and kbA=242 (d). The results are averaged over 102×103 independent realizations in 102 networks.

FIG. 5.

When d = 1, the mean prevalence i(T) and its variability Δ[i(T)] at T = 20 as a function of the degree of the initial seed, where i(T) versus kfor kbA=5 (a) and kbA=242 (b), Δ[i(T)] versus k for kbA=5 (c) and kbA=242 (d). The results are averaged over 102×103 independent realizations in 102 networks.

Close modal

When kbA=5, the large degree of the initial seed will result in a large ta (Fig. 4(a)) and a small Δ(ta) (Fig. 4(c)). Owing to the finite contact ability of the initial seed with the large degree, it will cost more time to infect the bridge node bA in the first process, that is the large ta. In the process, the bridge node bA may be infected through two basic pathways. For the initial seed with a small degree, the bridge node bA is infected directly with higher probability. Thus, the first process introduces a larger Δ(t1) because of the randomness of the Poisson process, and thus Δ(ta) in the whole process increases according to Eq. (8). In addition, Fig. 5(c) shows that the variability of the prevalence increases with the degree of the initial seed, which is consistent with the trend of the mean arrival time in Fig. 4(a). We should note that the degree of the initial seed has an opposite effect on the variabilities of the arrival time and the prevalence, which may bring about a great trouble for the pandemic prevention and control.

When kbA=242, the whole infection process is dominated by the second Poisson process (i.e., very large t2 in Eq. (8)). Therefore, the different seeds with d = 1 cannot affect the variability of epidemic dynamics visibly (see Figs. 4(b), 4(d), 5(b), and 5(d)). However, the very large Δ(ta)0.90 and Δ[i(T)]2.50 mean that it is difficult to accurately forecast the epidemic spreading when the degree of the bridge node is large. In order to ensure the universality of the above results, other kbA are also used to simulate the infection process. As expected, all simulations reveal the same conclusion.

In real-world community networks with the modularity Q[0.3,0.7],1 there are many weak ties between communities. In this section, we would like to understand the effects of the number of weak ties on the predictability of epidemic dynamics. To gain a clear idea of the relation between the modularity Q and the number of weak ties, Eq. (1) is expanded as

(9)

where lAB represents the number of weak ties between the community A and the community B. When the network is connected randomly, lAB(dAdB)/2L, thus Qrand0; When lAB=0 and dA=dB, Q reaches the maximum value, i.e., Qmax=0.5. Therefore, Q can only range from 0 to 0.5. Substituting Qrand=0 and Qmax=0.5 into Eq. (2), we have

(10)

After the standardization, Qn can range from 0 to 1. By adding the number of weak ties lAB between two communities randomly, we can obtain the community networks with different Qn.

Fig. 6 shows the case of three kinds of the initial seed: bridge node, random node, and hub. In Fig. 6(a), with the increase of Qn, the mean arrival time ta will increase because fewer weak ties lengthen the distance between two communities. Especially when Qn0.9, the mean arrival time ta will increase rapidly. Compared with the other two cases, the case of the bridge node chosen as the initial seed has the shortest ta. The case of the random node includes the cases of the bridge node and the non-bridge node, so ta for the random seed must be longer than that for the bridge seed. For the case of the hub, it has the longest mean arrival time. As the nodes connected by weak ties are chosen randomly, the nodes with a small degree will be more probably chosen as bridge nodes due to the degree heterogeneity, while it is very difficult for the hubs to be bridge nodes. When the degree of the bridge node is small, the initial seed with a large degree leads to the longer ta (see Fig. 4(a)).

FIG. 6.

The mean arrival time ta and its variability Δ(ta) as a function of the modularity Qn where the “squares,” “circles,” and “triangles” denote the cases of the bridge seed, the random seed, and the hub seed, respectively. (a) ta versus Qn, (b) Δ(ta) versus Qn. The parameters are chosen as NA=NB=0.5×104,k=10,andλ=1. We perform the experiments on 102 different networks, each of which are tested in 103 independent realizations.

FIG. 6.

The mean arrival time ta and its variability Δ(ta) as a function of the modularity Qn where the “squares,” “circles,” and “triangles” denote the cases of the bridge seed, the random seed, and the hub seed, respectively. (a) ta versus Qn, (b) Δ(ta) versus Qn. The parameters are chosen as NA=NB=0.5×104,k=10,andλ=1. We perform the experiments on 102 different networks, each of which are tested in 103 independent realizations.

Close modal

Moreover, the variabilities of the arrival time Δ(ta) for these three cases are compared in Fig. 6(b). As mentioned in Fig. 1(b), the further distance of the initial seed to the bridge node bA can reduce the variability Δ(ta). With the increase of Qn, the distance between two communities is lengthened by fewer weak ties, and thus Δ(ta) will decrease. For example, when Qn increases from 0.9 to 1, Δ(ta) for the random case decreases from 0.33 to 0.18 rapidly. For the case of the hub, its arrival time has the most accurate predictability. It is because that the initial seed with a large degree can lead to a low variability when the bridge node bA has a small degree (see Fig. 4(c)). More significant, Δ(ta) for the case of the bridge node increases with the network modularity Qn, which is opposite to the other two cases. Even though a bridge node biA is chosen as the initial seed, the infection to community B is not always through the weak tie of the bridge node biA because of the existence of many other weak ties. That is, there are two optional spreading pathways towards the community B: the weak tie of biA and the other weak ties. Thus, the actual path length of epidemic spreading must be greater than 1. In other words, more links will result in the longer distance of the infection process. Therefore, more weak ties can reduce Δ(ta) due to the increase of the distance between the community A and the community B. Actually, more weak ties can increase the deterministic of the second optional pathway, and thus enhances the predictability of the arrival time.

Furthermore, the effects of the number of weak ties on the predictability of the prevalence in the early stage of outbreaks are also analyzed in Fig. 7. As the mean arrival time increases with Qn in Fig. 6(a), the prevalence i(T) at T = 2 will decrease accordingly in Fig. 7(a). i(T)(hub)<i(T)(random)<i(T)(bridgenode) is resulted from ta(hub)>ta(random)>ta(bridgenode). Fig. 7(b) shows that the variability of the prevalence Δ[i(T)] increases with Qn. For the case of the bridge node, the change of Δ[i(T)] is very little, which means that the bridge node plays a significant role in enhancing the predictability of the prevalence.42 For the cases of the randomly chosen node and the hub, Δ[i(T)] increases slowly when Qn[0.3,0.9), while Δ[i(T)] increases rapidly when Qn[0.9,1) (e.g., Δ[i(T)]13.10 for Qn0.99), which is in accordance with the trend of ta. The results at other T values (e.g., T = 5 and 10) reveal the same conclusion. It implies that the strong community structure can increase the difficulty of the predictability of the prevalence.

FIG. 7.

At T = 2, the mean prevalence i(T) and its variability Δ[i(T)] as a function of the modularity Qn where the “squares,” “circles,” and “triangles” denote the cases of the bridge seed, the random seed, and the hub seed, respectively. (a) i(T) versus Qn, (b) Δ[i(T)] versus Qn. We perform the experiments on 102 different networks, each of which are tested in 103 independent realizations.

FIG. 7.

At T = 2, the mean prevalence i(T) and its variability Δ[i(T)] as a function of the modularity Qn where the “squares,” “circles,” and “triangles” denote the cases of the bridge seed, the random seed, and the hub seed, respectively. (a) i(T) versus Qn, (b) Δ[i(T)] versus Qn. We perform the experiments on 102 different networks, each of which are tested in 103 independent realizations.

Close modal

In conclusion, we have studied the effects of weak ties on the predictability of the epidemic dynamics on a heterogeneous network with local community structure. First, we have shown that the degree of bridge nodes can remarkably influence the variabilities of both the arrival time and the prevalence of disease. With the increase of the degree of the bridge node, the mean arrival time and the outbreak (i.e., the prevalence) of disease will be delayed, but their variabilities will still increase because of the Poisson property of the transmission from the bridge node bA to bB.

Second, we have shown that the distance between the initial seed and the bridge node, as well as the degree of the initial seed when d = 1, has different impacts on the epidemic predictability under different conditions. When the degree of the bridge node is large, the variability of the arrival time is close to 1, and the variability of the prevalence is very large. When the degree of the bridge node is small, the further distance of the initial seed to the bridge node (or the larger degree of the initial seed when d = 1) will enhance the predictability of the arrival time due to the approximate determinacy of the indirect transmission in infecting the bridge node bA, while the predictability of the prevalence in the early stage will get worse due to the uncertain arrival time of disease. These results suggest that the bridge node with a small degree may be an important detection station for the epidemic control on community networks. Taking global epidemics on the airline network, for example,38,67 the slackened precaution at a small international airport may bring a lot of trouble to the prediction and control of disease.

Moreover, we also have analyzed the effects of the number of weak ties on the epidemic predictability where the results of three different initial seeds (i.e., bridge node, random node, and hub) were compared. With the increase of the network modularity, the variabilities of the arrival time for the case of the random node and the hub will decrease slowly when Qn[0.3,0.9], and then drop sharply when Qn is greater than 0.9. It is important to note that the variability of the prevalence will increase rapidly when the community strength is strong enough, which is in accordance with the trend of the mean arrival time. By contrast, owing to the Poisson property of the transmission on weak ties, the variability of the arrival time for the case of the bridge node increases with the network modularity, which displays the worst predictability of the arrival time. But the best predictability of the prevalence is observed when the bridge node is first infected, which is originated from the shortest arrival time for this case. These results suggest that although the strong community structure can delay the infection from one community to another, it can also cause the unpredictability of the epidemic spreading on these communities. It is very important to avoid the infection of bridge nodes in the early stage of outbreaks for a better prediction of the spreading on community networks.

The above results provide us further understanding and new perspective in the effect of weak ties on epidemic spreading. This work only focused on the CP model with limited infectivity, but the other type of epidemic spreading (i.e., epidemic spreading with unlimited infectivity) may change the above conclusions qualitatively. Thus, on a heterogeneous network with local community structure, how weak ties influence the predictability of the unlimited spreading is an interesting question.

This work is supported by the NNSF of China (Grants No. 11105025), China Postdoctoral Science Foundation (Grant No. 20110491705), the Specialized Research Fund for the Doctoral Program of Higher Education (Grant No. 20110185120021), the Fundamental Research Funds for the Central Universities (Grant No. ZYGX2011J056), and China Postdoctoral Science Special Foundation (Grant No. 2012T50711).

1.
M. E. J.
Newman
,
Phys. Rev. Lett.
89
,
208701
(
2002
);
[PubMed]
M. E. J.
Newman
,
Phys. Rev. E
67
,
026126
(
2003
);
M. E. J.
Newman
and
J.
Park
,
Phys. Rev. E
68
,
036122
(
2003
);
M. E. J.
Newman
and
M.
Girvan
,
Phys. Rev. E
69
,
026113
(
2004
).
2.
See http://www.facebook.com for Facebook.
3.
See http://www.youtube.com for Youtube.
4.
See http://www.xiaonei.com for Xiaonei.
5.
H.-H.
Jo
,
R. K.
Pan
, and
K.
Kaski
,
PLoS ONE
6
(
8
),
e22687
(
2011
).
6.
E.
Ferrara
,
P.
De Meo
,
G.
Fiumara
, and
A.
Provetti
, e-print arXiv:1203.0535v1 (
2012
).
7.
P. A.
Grabowicz
,
J. J.
Ramasco
,
E.
Moro
,
J. M.
Pujol
, and
V. M.
Eguiluz
,
PLoS ONE
7
(
1
),
e29358
(
2012
).
8.
M. S.
Granovetter
,
Am. J. Sociol.
78
,
1360
(
1973
).
10.
E. L.
Berlow
,
Nature
398
,
330
(
1999
).
11.
X.-Q.
Cheng
,
F.-X.
Ren
,
H.-W.
Shen
,
Z.-K.
Zhang
, and
T.
Zhou
,
J. Stat. Mech.: Theory Exp.
2010
,
P10011
.
12.
P.
Csermely
,
Weak Links: The Universal Key to the Stability of Networks and Complex Systems
(
Springer
,
Heidelberg, Germany
,
2009
).
13.
T.
Gross
,
L.
Rudolf
,
S. A.
Levin
, and
U.
Dieckmann
,
Science
325
,
747
(
2009
).
14.
N. T. J.
Bailey
,
The Mathematical Theory of Infectious Diseases
, 2nd ed. (
Griffin
,
London
,
1975
).
15.
R. M.
Anderson
and
R. M.
May
,
Infectious Disease of Humans
(
Oxford University Press
,
Oxford
,
1992
).
16.
O.
Diekmann
and
J. A. P.
Heesterbeek
,
Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation
(
Wiley
,
New York
,
2000
).
17.
D. J.
Dailey
and
J.
Gani
,
Epidemic Modelling: An Introduction
(
Cambridge University Press
,
Cambridge
,
2001
).
18.
S.
Boccaletti
,
V.
Latora
,
Y.
Moreno
,
M.
Chavez
, and
D.-U.
Hwang
,
Phys. Rep.
424
,
175
(
2006
).
19.
S. N.
Dorogovtsev
,
A. V.
Goltsev
, and
J. F. F.
Mendes
,
Rev. Mod. Phys.
80
,
1275
(
2008
).
20.
A.
Barrat
,
M.
Barthelmy
, and
A.
Vespignani
,
Dynamical Processes on Complex Networks
(
Cambridge University Press
,
New York
,
2008
).
21.
M. E. J.
Newman
,
Networks: An Introduction
(
Oxford University Press
,
Oxford, UK
,
2010
).
23.
Z.
Liu
and
B.
Hu
,
Europhys. Lett.
72
(
2
),
315
(
2005
).
24.
L.
Huang
,
K.
Park
, and
Y.-C.
Lai
,
Phys. Rev. E
73
,
035103
R
(
2006
).
25.
Y.
Zhou
,
Z.
Liu
, and
J.
Zhou
,
Chin. Phys. Lett.
24
,
581
(
2007
).
26.
G.
Yan
,
Z.-Q.
Fu
,
J.
Ren
, and
W.-X.
Wang
,
Phys. Rev. E
75
,
016108
(
2007
).
27.
X.
Wu
and
Z.
Liu
,
Physica A
387
,
623
(
2008
).
28.
J.-P.
Onnela
,
J.
Saramäki
,
J.
Hyvönen
,
G.
Szabó
,
D.
Lazer
,
K.
Kaski
,
J.
Kertész
, and
A.-L.
Barabási
,
Proc. Natl. Acad. Sci. U.S.A.
104
,
7332
(
2007
).
29.
D.
Centola
and
M.
Macy
,
Am. J. Sociol.
113
,
702
(
2007
).
30.
J. C.
Zhao
,
J. J.
Wu
, and
K.
Xu
,
Phys. Rev. E
82
,
016105
(
2010
).
31.
A.
Mihalik
and
P.
Csermely
,
PLoS Comput. Biol.
7
(
10
),
e1002187
(
2011
).
32.
H.
Yang
,
M.
Tang
, and
H.-F.
Zhang
, e-print arXiv:1205.4352v1 (
2012
).
33.
L.
Hufnagel
,
D.
Brockmann
, and
T.
Geisel
,
Proc. Natl. Acad. Sci. U.S.A.
101
,
15124
(
2004
).
34.
M.
Barthélemy
,
A.
Barrat
,
R.
Pastor-Satorras
, and
A.
Vespignani
,
J. Theor. Biol.
235
,
275
(
2005
).
35.
A.
Gautreau
,
A.
Barrat
, and
M.
Barthélemy
,
J. Stat. Mech.: Theory Exp.
2007
,
L09001
.
36.
A.
Gautreau
,
A.
Barrat
, and
M.
Barthélemy
,
J. Theor. Biol.
251
,
509
(
2008
).
37.
M.
Barthélemy
,
C.
Godrèche
, and
J.-M.
Luck
,
J. Theor. Biol.
267
,
554
(
2010
).
38.
V.
Colizza
,
A.
Barrat
,
M.
Barthlemy
, and
A.
Vespignani
,
Proc. Natl. Acad. Sci. U.S.A.
103
,
2015
(
2006
).
39.
P.
Crépey
,
F. P.
Alvarez
, and
M.
Barthélemy
,
Phys. Rev. E
73
,
046131
(
2006
).
40.
M.
Loecher
and
J.
Kadtke
,
Phys. Lett. A
366
,
535
(
2007
).
41.
W.
Huang
and
C.
Li
,
J. Stat. Mech.: Theory Exp.
2007
,
P01014
.
42.
K.
Gong
,
M.
Tang
,
H.
Yang
, and
M.
Shang
,
Chaos
21
,
043130
(
2011
).
43.
Z.-D.
Zhao
,
Y.
Liu
, and
M.
Tang
,
Chaos
22
,
023150
(
2012
).
44.
J. D.
Han
,
N.
Bertin
,
T.
Hao
,
D. S.
Goldberg
,
G. F.
Berriz
,
L. V.
Zhang
,
D.
Dupuy
,
A. J.
Walhout
,
M. E.
Cusick
,
F. P.
Roth
, and
M.
Vidal
,
Nature
430
,
88
(
2004
).
45.
N. N.
Batada
,
T.
Reguly
,
A.
Breitkreutz
,
L.
Boucher
,
B.-J.
Breitkreutz
,
L. D.
Hurst
, and
M.
Tyers
,
PLoS Biol.
4
,
e317
(
2006
).
46.
N. N.
Batada
,
T.
Reguly
,
A.
Breitkreutz
,
L.
Boucher
,
B.-J.
Breitkreutz
,
L. D.
Hurst
, and
M.
Tyers
,
PLoS Biol.
5
,
e154
(
2007
).
47.
Y.
Wang
,
Y.-Q.
Hu
,
Z.-R.
Di
, and
Y.
Fan
,
Physica A
390
,
4027
(
2011
).
48.
M.
Girvan
and
M. E. J.
Newman
,
Proc. Natl. Acad. Sci. U.S.A.
99
,
7821
(
2002
).
49.
M. E. J.
Newman
,
Nat. Phys.
8
,
25
(
2012
).
50.
A.-L.
Barabási
and
R.
Albert
,
Science
286
,
509
(
1999
).
51.
R.
Albert
and
A.-L.
Barabási
,
Rev. Mod. Phys.
74
,
47
(
2002
).
52.
T.
Nepusz
,
A.
Petróczi
,
L.
Négyessy
, and
F.
Bazsó
,
Phys. Rev. E
77
,
016107
(
2008
).
53.
H.
Toyoizumi
,
S.
Tani
,
N.
Miyoshi
, and
Y.
Okamoto
,
Phys. Rev. E
86
,
021103
(
2012
).
54.
F.
Liljeros
,
C. R.
Rdling
,
L. A. N.
Amaral
,
H. E.
Stanley
, and
Y.
Åberg
,
Nature
411
,
907
(
2001
).
55.
T.
Zhou
,
J.-G.
Liu
,
W.-J.
Bai
,
G.-R.
Chen
, and
B.-H.
Wang
,
Phys. Rev. E
74
,
056109
(
2006
).
56.
C.
Castellano
and
R.
Pastor-Satorras
,
Phys. Rev. Lett.
96
,
038701
(
2006
).
57.
M.
Ha
,
H.
Hong
, and
H.
Park
,
Phys. Rev. Lett.
98
,
029801
(
2007
).
58.
C.
Castellano
and
R.
Pastor-Satorras
,
Phys. Rev. Lett.
98
,
029802
(
2007
).
59.
H.
Hong
,
M.
Ha
, and
H.
Park
,
Phys. Rev. Lett.
98
,
258701
(
2007
).
60.
R.
Yang
,
B.-H.
Wang
,
J.
Ren
,
W.-J.
Bai
,
Z.-W.
Shi
,
W.-X.
Wang
, and
T.
Zhou
,
Phys. Lett. A
364
,
189
(
2007
).
61.
R.
Yang
,
T.
Zhou
,
Y.-B.
Xie
,
Y.-C.
Lai
, and
B.-H.
Wang
,
Phys. Rev. E
78
,
066109
(
2008
).
62.
R.
Yang
,
L.
Huang
, and
Y.-C.
Lai
,
Phys. Rev. E
78
,
026111
(
2008
).
63.
J. D.
Noh
and
H.
Park
,
Phys. Rev. E
79
,
056115
(
2009
).
64.
S. H.
Lee
,
M.
Ha
,
H.
Jeong
,
J. D.
Noh
, and
H.
Park
,
Phys. Rev. E
80
,
051127
(
2009
).
65.
M. A.
Muñoz
,
R.
Juhász
,
C.
Castellano
, and
G.
Ódor
,
Phys. Rev. Lett.
105
,
128701
(
2010
).
66.
Y.
Moreno
,
R.
Pastor-Satorras
, and
A.
Vespignani
,
Eur. Phys. J. B
26
,
521
(
2002
).
67.
R.
Guimerà
,
S.
Mossa
,
A.
Turtschi
, and
L. A. N.
Amaral
,
Proc. Natl. Acad. Sci. U.S.A.
102
,
7794
(
2005
).