For many epidemic networks some connections between nodes are treated as deterministic, while the remainder are random and have different connection probabilities. By applying spectral analysis to several constructed models, we find that one can estimate the epidemic thresholds of these networks by investigating information from only the deterministic connections. Nonetheless, in these models, generic nonuniform stochastic connections and heterogeneous community structure are also considered. The estimation of epidemic thresholds is achieved via inequalities with upper and lower bounds, which are found to be in very good agreement with numerical simulations. Since these deterministic connections are easier to detect than those stochastic connections, this work provides a feasible and effective method to estimate the epidemic thresholds in real epidemic networks.
In many real complex networks, it is well known that the connections between nodes are neither completely deterministic nor stochastic. In general, certain connections are deterministic, while the rest are random. For example, in a human epidemic network, each individual has generally deterministic connections with family, close friends, and relatives, while may have stochastic connections with colleagues, strangers, and so on. Moreover, information characterizing these deterministic connections can be more easily obtained than to adequately describe the behaviour of stochastic connections: that is, survey data and so on can provide an accurate picture of the deterministic, but not the stochastic contacts. When studying disease propagation, the epidemic threshold—the level of transmission at which a disease transitions from endemic to extinct—is the most important descriptor. Hence, when we study epidemic behavior on a complex network, it is extremely useful if we only need to use the deterministic connection information to estimate the corresponding epidemic threshold. In this paper, we apply theoretical analysis on several generic models and find that one can estimate the epidemic threshold from these deterministic connections and connection probability. Numerical simulations are further presented to both demonstrate and validate our results.
I. INTRODUCTION
The analysis of the epidemic threshold is a very important topic for the study of the dynamical behavior and control methods for epidemic spreading on complex networks. Many reported results have shown that the topological structure of an epidemic network plays a vital role in its epidemic threshold. By using the heterogeneous mean-field (HMF) method on a standard SIS epidemic network,1 its epidemic threshold is given by , where and are the first and second moments of the network degree distribution. By using linear stability analysis on an SIS Markovian epidemic network, a more exact result is that the epidemic threshold is given by βc = 1/λ1, where λ1 is the largest eigenvalue of the network's adjacency matrix.2–4 With this observation, the influence of the network topological characteristics on the spreading behaviour has been further investigated in depth.5–8 Finally, by embedding more realistic factors into traditional epidemic networks, many epidemic thresholds have been derived on multiplex networks such as epidemic networks with awareness,9–13 traffic-driven epidemic networks,14,15 epidemic networks with community structure,16,17 interconnected epidemic networks,18–20 time-varying epidemic networks,21–23 adaptive epidemic network,24,25 and so on.
While there is already much work addressing the epidemic thresholds of various epidemic networks, a thorough investigation of epidemic threshold of epidemic network with both deterministic and stochastic connections has not yet been done. In a real (social) epidemic network, each individual generally has both deterministic neighboring nodes (e.g., family and relatives) and stochastic nodes (e.g., colleagues and strangers). In fact, the well-known NW small-world network26 is generated using this same idea to address the transition properties between regular-lattice and random-lattice behavior in social networks. Consequently, the whole network can be divided into two unattached sub-networks: deterministic network and stochastic network. To the best of our knowledge, this network division method has not been applied to the study of epidemic transmission on networks or epidemic thresholds. In addition, different people have generally different probabilities for the connections with these stochastic neighboring nodes. Hence, the diversity of connection probability should be considered to obtain more reasonable models (see the case of nonuniform stochastic connections in Sec. IV). For universality, individual awareness and community structure will be also considered in our models. It is well known that many social networks have the community structures, where there exists a high density of intra-connections within each community, and a lower density of inter-connections between communities. In view of these (above) factors, to achieve the epidemic threshold estimation, we will first construct several SIS Markovian epidemic networks with deterministic and stochastic connections.
The main objective of this paper is to estimate the epidemic thresholds of these networks. In general, it is impossible to gain global connection information among all nodes in an epidemic network to calculate its exact epidemic threshold, as the network size is very large; the stochastic connections are time-varying, and so on. Therefore, it is natural to raise the following question: Can we estimate the epidemic thresholds of these networks using only the deterministic connection information?
In this paper, based on spectral analysis, we provide a positive answer to this question. By theoretical analysis, we have obtained some inequality estimations about the epidemic thresholds to give their upper bounds and lower bounds, which are just dependent on the topological structure of deterministic connections and stochastic connection probabilities. An optimal analysis for the upper bound is also developed. By using numerical simulations, these inequality estimations are shown to be extremely accurate.
The rest of this paper is organized as follows. In Sec. II, we give some preliminaries about epidemic network and graph theory. In Secs. III and IV, we estimate the thresholds of an epidemic network with uniform and nonuniform stochastic connection probability, respectively. In Sec. V, an epidemic network with community structure is considered. In Sec. VI, numerical simulations are given to verify the estimations in Secs. III–V. Finally, in Sec. VII, we conclude this paper.
II. PRELIMINARIES
First, we provide some introductory remarks about complex networks and the spectral analysis of graphs.27 The topological structure of a complex network with size n can be represented by a graph G. The graph G, in turn, can be represented by its adjacency matrix A = (aij)n×n, whose elements are either one or zero depending on whether there is a connection between nodes i and j. In this paper, we only consider undirected complex networks, i.e., the adjacency matrix is a real symmetric matrix. We say the pair of nodes (i, j) ∈ G means that the nodes i and j are connected, i.e., aij = aji = 1, otherwise, aij = aji = 0. It is assumed further that the graph G does not contain self loops (aii = 0) nor multiple links between two nodes. The complement Gc of the graph G consists of the same set of nodes but with (i, j) ∈ G if (i, j) ∉ Gc and vice versa. The topological structure of Gc is characterized by its adjacency matrix . According to graph theory, we can define that if aij = 1, then ; if aij = 0, then ; and for all i = 1,2,…, n. It is easy to see that Ac = Jn – In – A, where Jn is the all one matrix and In is the identity matrix with order n.
Since the eigenvalues of the adjacency matrix A are real, they can be ordered as λ1(A) ≥ λ2(A) ≥⋯ ≥ λn(A). The largest eigenvalue λ1(A) is also called the spectral radius of the graph. The largest and smallest eigenvalues often appear in the following supremum and infimum forms
Lemma 1. (Ref. 27) For symmetric n × n matrices A, B, it holds that
where k = 1,2,…, n.
Suppose that adjacency matrices and . Obviously, the matrix is semi-positive definite. From Ref. 27 (page 131), we know that if λ > 0 is an eigenvalue of , then are two eigenvalues of B. So, by using Lemma 1, we have the following result.
Lemma 2. If γ ∈ [0, + ∞), the largest eigenvalue of A + γB is bounded by
Lemma 3 (Perron-Frobenius Theorem27). An irreducible nonnegative n × n matrix A always has a real, positive eigenvalue λ1(A), and the modulus of any other eigenvalue does not exceed λ1(A). Moreover, λ1(A) is a simple zero of the characteristic polynomial det(A – λIn). The eigenvector belonging to λ1(A) has positive components.
Now, we present the introduction about the traditional SIS Markovian epidemic network. In this network, each node can be in one of two distinct states at each time: susceptible (S) or infected (I). Each infected node can recover to be susceptible with probability δ in every time step. Each susceptible node has a probability β of contagion through contact with each of its infected neighbors. So, we can define an effective spreading rate β/δ. Without loss of generality, one can let δ = 1. Letting pi(t) denotes the probability of individual i to be infected at time t in the network, the dynamical process of epidemic spreading can be described by the following equations12,13 with continuous time:
In this case, all connections of the network are deterministic and characterized by adjacency matrix A = (aij)n×n. By letting p(t) = (p1(t), p2(t),…, pn(t)), the Jacobian matrix at zero solution p(t) = 0 is given by −In + βA. By the asymptotic stability condition,2–4 the zero solution is asymptotically stable if λ1(−In + βA) < 0, which leads to the epidemic threshold βc = 1/λ1(A). If β is below βc, the infection will gradually die out, while if β is above βc, the infection spreads and becomes endemic.
In the following sections, we will consider the case where only some of the connections of the network are deterministic and the remainder is stochastic. Based on several mathematical models of the epidemic network, we focus on the study of estimating their epidemic thresholds by utilizing only the deterministic connection information.
III. WITH UNIFORM STOCHASTIC CONNECTIONS
The whole network can be divided into two unattached sub-networks: deterministic network G and stochastic network Gc. Fig. 1 presents a schematic diagram of an epidemic network with size n = 6. Any pair of nodes has deterministic connection in the deterministic network G, whose topological structure is characterized by an adjacency matrix A = (aij)n×n, whose elements are either one or zero depending on whether there is a deterministic connection between nodes i and j. In addition, any pair of nodes has stochastic connection in the network Gc, which is the complement of G. For each pair of nodes (i, j) ∈ Gc, the connection probability between them is α, which means that the connections in stochastic network Gc have uniform stochastic connections.
Example of an epidemic network with two unattached sub-networks: deterministic network G and stochastic network Gc.
Example of an epidemic network with two unattached sub-networks: deterministic network G and stochastic network Gc.
According to the above connection mechanism, the dynamical process of epidemic spreading can be described by the following equations:
Epidemic threshold and coupling matrix—It is easy to get the Jacobian matrix at zero solution of network (3) as −In + βW, where W = A + αAc. From the analysis in Sec. II, we know βc = 1/λ1(W). For convenience, we name matrix W as the coupling matrix in this paper. In fact, the coupling matrix W is a generalized form of adjacency matrix A in Sec. II. In order to estimate this epidemic threshold, we turn to seek the upper bound and lower bound of λ1(W) by only using adjacency matrix A and stochastic connection probability.
Theorem 1. Suppose x = (x1, x2,…, xn)T ∈ Rn, Ax = λ1(A)x, and xTx = 1. Then, the epidemic threshold of network (3) satisfies
Proof. Since for every y ∈ Rn, yTy = 1,
we have
In addition, with Ax = λ1(A)x, we get
By noting that βc = 1/λ1(W), we can obtain the inequalities in this theorem. ◻
From Theorem 1, we can see that the upper and lower bounds of epidemic threshold βc only depend on the topological structure of graph G and connection probability α.
Corollary 1. If for all i = 1,2,…,n, then the epidemic threshold of network (3) is given by .
Proof. If for all i = 1,2,…,n, we know that satisfies Ax = λ1(A)x = kx. From (5), we get . By combining Eqs. (4) and (5), we have , which leads to . ◻
For example, the NW small-world network with size n is generated with probability α for adding long-range connections, where each node is symmetrically connected with its k nearest neighbors in its initial nearest-neighbor network G. Obviously, for all i = 1,2,…,n. So, when we consider an epidemic dynamics in this network, from Corollary 1, its epidemic threshold is [k + α(n – k – 1)]−1, where k + α(n – k – 1) is the average degree of network G. This result is consistent with the theoretical threshold in homogenous epidemic network.28
IV. NONUNIFORM STOCHASTIC CONNECTIONS
In general, due to the individual diversity, different nodes have different connection probabilities when they contact their neighboring stochastic nodes. That is to say, the spreading network generally includes nonuniform stochastic connections. To realize this connection mechanism, for (i, j) ∈ Gc, let dij be the probability with which the node i connects its stochastic neighbor node j. This means that if (i, j) ∈ Gc, then there is a connection between them with probability dijdji. Certainly, if the stochastic transmission occurs only on some of the connections of Gc, then the corresponding dij = 0. In particular, in the case of uniform stochastic connections, we have dijdji = α for all (i, j) ∈ Gc.
According to the above connection mechanism, the dynamical process of epidemic spreading can be described as
The coupling matrix of network (6) can be written as
where
and 0 – 1 symmetrical matrix
with only two displayed nonzero elements. It is easy to see that . Let ○Gc) be the number of connections in Gc. Obviously, we have . Then, we attain the following theorem.
Theorem 2. Suppose x = (x1, x2,…, xn)T ∈ Rn, Ax = λ1(A)x, and xTx = 1. Then, the epidemic threshold of network (6) satisfies
Proof. On one hand, for every y ∈ Rn, yTy = 1, since for all (i, j) ∈ Gc, we get
which leads to
On the other hand, if Ax = λ1(A) x and xTx = 1, we have
As a special case, if dij = di for (i, j) ∈ Gc, then the dynamical process of epidemic spreading can be described as
The coupling matrix of network (10) is W = A + DAcD, where D = diag{d1, d2,…, dn}. Then, we obtain the following result.
Theorem 3. Suppose x = (x1, x2,…, xn)T ∈ Rn, Ax = λ1(A)x, and xTx = 1. Then, the epidemic threshold of network (10) satisfies
Proof. For every y ∈ Rn, yTy = 1, since
we obtain
As , we have , which leads to
Similarly, we obtain
Obviously,
By integrating Eqs. (12)–(15), we conclude that
From Eq. (11), if Ax = λ1(A)x, we have
Therefore, by noting that βc = 1/λ1(W), we can obtain the result of this theorem. ◻
Now, we give an application of Theorem 3 for an epidemic network with awareness. Suppose that all nodes in the network have individual protection awareness which is adjusted instantaneously by the infection density of their neighboring deterministic nodes. We find that the individual protection awareness will not change the epidemic threshold. For example, we consider the local protection awareness by letting connection probability
With time-varying , network (10) can be rewritten as
where i = 1,2,…,n. It is obvious that the coupling matrix of network (19) is still W = A + DAcD. Thus, we have the following result.
V. UNIFORM STOCHASTIC CONNECTIONS AND COMMUNITY STRUCTURE
In this section, we consider that the deterministic network G has community structure. Without loss of generality, we suppose that G has two communities with sizes m and n, respectively. The inner connections within two communities are characterized by adjacency matrix , where A1 ∈ Rm and A2 ∈ Rn. The outer connections between two communities are characterized by adjacency matrix , where B ∈ Rm × n. Then the adjacency matrix of deterministic network , where A1 and A2 are symmetric, and B is generally asymmetric.
The dynamical process of epidemic spreading can be described by the following equations:
Define , where is the all one matrix. It is easy to verify that . The coupling matrix of network (20) is
Theorem 4. If Az = λ1(A)z and zTz = 1, the epidemic threshold of network (20) satisfies
Proof. From (4) and Lemma 2, we get
By applying Lemma 2, we have
which results in
So, by noting that βc = 1/λ1(W), we can obtain the result of this theorem. ◻
Suppose x ∈ Rm, y ∈ Rn, A1x = λ1(A1)x, A2y = λ1(A2)y, and xTx = yTy = 1. Let . Let , , and . Now, to improve the estimation power, we present optimized upper bound for βc by using the Lagrange multipliers method.
Corollary 3. The optimal upper bound for βc of network (20) is given by .
Proof. First, from Perron-Frobenius Theorem (see Lemma 3), we know that x > 0, y > 0, which leads to μ1 > 0, μ2 > 0, μ3 > 0, and μ4 > 0. Let z = (axT, byT)T with a2 + b2 = 1, which means zTz = 1. From (21), we have
Let . We need to solve the following optimization problem:
If (a*, b*) is the optimal solution of (25), then is the optimal lower bound for λ1(W), and is the optimal upper bound for βc.
By the Lagrange multipliers method, we define the Lagrange function as
where θ is the Lagrange multiplier. From the optimization condition , we obtain
By adding the above first equation to the second equation, we have . Since (a, b) ≠ (0, 0), we require that
From above equation, we obtain
VI. NUMERICAL SIMULATIONS
In this section, we present some numerical examples to show the effectiveness of epidemic threshold estimations in Secs. III–V.
First, we consider the case of epidemic network with uniform stochastic connections. Without loss of generality, the topological structure of deterministic network G is characterized by WS small-world network29 or BA network.30 The WS network is generated with probability 0.1 for rewiring links, where each node is symmetrically connected with its six nearest neighbors in its initial nearest-neighbor network. The BA network is produced with four initial nodes, which are fully connected, and then adding a new node with three new edges at each time step. The epidemic threshold is computed by βc = 1/λ1(W). The upper bound and lower bound are given by Theorem 1. Fig. 2 gives some comparisons between the epidemic threshold and upper-lower bound estimation under different network size n and uniform stochastic probability α. From this figure, we can see that the epidemic threshold is always bounded by upper bound and lower bound.
Comparisons between the epidemic threshold βc and upper-lower bound estimation under different network sizes n and uniform stochastic probability α = 0.01 in (a), (c), and α = 0.001 in (b), (d).
Comparisons between the epidemic threshold βc and upper-lower bound estimation under different network sizes n and uniform stochastic probability α = 0.01 in (a), (c), and α = 0.001 in (b), (d).
Next, we consider the case of epidemic network with nonuniform stochastic connections. Suppose that are uniformly distributed within [0, η] with 0 < η ≤ 1. Fig. 3 shows some comparisons between the epidemic threshold and upper-lower bound estimation under different network size n and parameter η. By decreasing parameter η, we can reduce statistically the number of stochastic connections. This figure verifies the upper-lower bound estimation in Theorem 3 very well. Integrating Figs. 2 and 3, it can be concluded that the smaller the stochastic connection probability is, the better the estimation will be. In order to explore the influence resulting from distribution, we further suppose that di is generated from a normal distribution with mean μ and variance σ2. The result is presented by Fig. 4, in which the upper-lower bound estimation in Theorem 3 is still valid.
Comparisons between the epidemic threshold βc and upper-lower bound estimation under different network sizes n and parameter η = 0.5 in (a), (c), and η = 0.2 in (b), (d).
Comparisons between the epidemic threshold βc and upper-lower bound estimation under different network sizes n and parameter η = 0.5 in (a), (c), and η = 0.2 in (b), (d).
Comparisons between the epidemic threshold βc and upper-lower bound estimation under different network sizes n and parameters μ = 0.1, σ2 = 0.001 in (a), (c), and μ = 0.2, σ2 = 0.001 in (b), (d).
Comparisons between the epidemic threshold βc and upper-lower bound estimation under different network sizes n and parameters μ = 0.1, σ2 = 0.001 in (a), (c), and μ = 0.2, σ2 = 0.001 in (b), (d).
Finally, we take into account the community structure within an epidemic network with uniform stochastic connections. Suppose that the deterministic network G has two communities with sizes m and n, which both have WS small-world network structure. By choosing all pairs of nodes from the different communities, each outer connection is randomly generated with probability p. In this particular example, we choose p = 0.01 and a uniform stochastic probability α = 0.01. Under different community sizes m and n, Fig. 5 gives some comparisons between the epidemic threshold and upper-lower bound estimation in Theorem 4. In this figure, there exist a big gap between the epidemic threshold and upper bound. In order to decrease this gap, we can utilize the optimal upper bound estimation in Corollary 3. For this purpose, we give a realization in Fig. 6 which shows smaller gap than Fig. 5.
Comparisons between the epidemic threshold βc and upper-lower bound estimation under different community sizes m and n.
Comparisons between the epidemic threshold βc and upper-lower bound estimation under different community sizes m and n.
Comparisons between the epidemic threshold βc, upper bound, and optimal upper bound estimation under different community sizes m and n.
Comparisons between the epidemic threshold βc, upper bound, and optimal upper bound estimation under different community sizes m and n.
VII. CONCLUSIONS
In this paper, we focus on the estimation of the epidemic threshold on networks with deterministic and stochastic connections. First, we have constructed several epidemic models with some general properties, including nonuniform stochastic connections, local protection awareness of individuals, and community structure. Second, by using the spectral analysis on these networks, we have obtained some inequality estimates of their epidemic thresholds. The results show that these inequalities are only dependent on the topological structure of deterministic connections and the stochastic connection probabilities. In other words, one can use the information of deterministic connections, but not necessarily from all connections, to estimate the epidemic threshold. This work provides a feasible method for us to estimate the epidemic thresholds in real epidemic networks, when complete description of the stochastic nature of the epidemic may be difficult to obtain.
To further understand the epidemic dynamics in real complex networks there are, of course, topics which need to be resolved in the future. These include the network with nonuniform stochastic connections and community structure, the network with multi-community structure, among many others. Another important problem is to develop more effective method to improve the estimation power.
ACKNOWLEDGMENTS
This research was supported by NSFC Grant (Nos. 61004101, 11161013, 61164020, and 11331009). The authors gratefully acknowledge the support of the Guangxi Natural Science Foundation Program (Nos. 2014GXNSFBA118007, 2011GXNSFB018059, and 2013GXNSFAA019006). M.S. is currently funded by the Australian Research Council via a Future Fellowship (No. FT110100896) and Discovery Project (No. DP140100203). And this work was supported by Guangxi Key Laboratory of Cryptography and Information Security, Guilin University of Electronic Technology. We also thank Haifeng Zhang for useful discussions.