- Research
- Open Access
Analysis of gene network robustness based on saturated fixed point attractors
- Genyuan Li^{1} and
- Herschel Rabitz^{1}Email author
https://doi.org/10.1186/1687-4153-2014-4
© Li and Rabitz; licensee Springer. 2014
- Received: 5 July 2013
- Accepted: 5 March 2014
- Published: 20 March 2014
Abstract
The analysis of gene network robustness to noise and mutation is important for fundamental and practical reasons. Robustness refers to the stability of the equilibrium expression state of a gene network to variations of the initial expression state and network topology. Numerical simulation of these variations is commonly used for the assessment of robustness. Since there exists a great number of possible gene network topologies and initial states, even millions of simulations may be still too small to give reliable results. When the initial and equilibrium expression states are restricted to being saturated (i.e., their elements can only take values 1 or −1 corresponding to maximum activation and maximum repression of genes), an analytical gene network robustness assessment is possible. We present this analytical treatment based on determination of the saturated fixed point attractors for sigmoidal function models. The analysis can determine (a) for a given network, which and how many saturated equilibrium states exist and which and how many saturated initial states converge to each of these saturated equilibrium states and (b) for a given saturated equilibrium state or a given pair of saturated equilibrium and initial states, which and how many gene networks, referred to as viable, share this saturated equilibrium state or the pair of saturated equilibrium and initial states. We also show that the viable networks sharing a given saturated equilibrium state must follow certain patterns. These capabilities of the analytical treatment make it possible to properly define and accurately determine robustness to noise and mutation for gene networks. Previous network research conclusions drawn from performing millions of simulations follow directly from the results of our analytical treatment. Furthermore, the analytical results provide criteria for the identification of model validity and suggest modified models of gene network dynamics. The yeast cell-cycle network is used as an illustration of the practical application of this analytical treatment.
Keywords
- Robustness of gene networks
- Fixed point attractor
- Sigmoidal function
- Yeast cell-cycle network
1 Introduction
A subset of genes in a cell whose protein products mutually regulate one another’s expression at the transcriptional level will be referred to as a ‘gene network’ in this paper. The concentration of proteins encoded by the genes changes in time due to auto- and cross-regulation by the gene products. Each of such network is considered as a dynamical system. Gene networks must be robust with respect to ever-changing environments. Robustness here refers to the ability of a gene network to respond to short-term changes in the environment and quickly return to its functional steady state. Moreover, a gene network itself may endure small structural changes and mutations, while still retaining its desired steady state. The robustness of gene networks depends on their topology with some networks being more stable than others. The analysis of the relationship between the topology of a gene network and its robustness to environmental and structural perturbations is important both theoretically and practically [1–10].
with S_{ i }∈[−1,1], and S_{ i }=0 corresponding to 50% of gene i being ‘on’. Notwithstanding the simplicity of (3), variants of this model have been successfully used to study (a) the robustness of gene regulatory networks [12, 16, 17], (b) the role of robustness in evolutionary innovation [18, 19], and (c) how recombination can produce negative epistasis [20].
which is similar to (3) with the additional parameters λ_{ a } and h^{ a }. This model has been successfully applied to treat the blastoderm of Drosophila melanogaster[21].
to describe the dynamics of a genetic regulatory network for Arabidopsis thaliana flower morphogenesis. This model is also similar to (3) except that the sigmoidal function σ is replaced by the Heaviside step function and a threshold parameter θ_{ i } is included.
All these models present simplified descriptions of gene network dynamics. Nevertheless, the models are still useful for obtaining insights into the dynamics of gene networks. In the following analysis for gene network robustness, we will employ the sigmoidal function model in (3), and its modification with threshold parameters.
The robustness of a gene network specified by W to noise (environmental) and mutation (structural) perturbations may be expressed as the stability of the final equilibrium (or steady) expression state S(∞) obtained from the solution of (3), respectively, to changes of initial expression state S(0) and to changes of W. A complete and reliable robustness analysis would seem to call for an exhaustive sampling over the space of possible initial states for a given network and then repeating the same simulations for all possible networks. This task is infeasible as there are many possible networks W, and each W may have many initial/final equilibrium expression states. Consider a simple case where S_{ i }(0) and S_{ i }(∞)(i=1,2,…,n) can only take values −1,1 and w_{ i j } can only take values −1, 0, 1. In this case, there are 2^{ n } possible initial/final states and ${3}^{{n}^{2}}$ possible gene networks. For a modest network of size n=20, there are 2^{20}=1,048,576 initial/final expression states and 3^{400}≈7×10^{190} gene networks. Even if one arbitrarily makes the restriction that 75% of the w_{ i j } interactions are zero, and that the remaining 25% of the w_{ i j } interactions can only take nonzero values −1,1 to reduce the possible number of networks [13], for n=20 there are still ${C}_{400}^{100}\times {2}^{100}$ possible gene networks. Further restriction may also be applied to reduce the possible number of initial expression states [13]. Even under these restrictions, solving (3) for all possible initial expression states and gene networks is still infeasible, and only a small fraction of them can be randomly sampled and simulated. Such limitations leave open the reliability of the conclusions obtained from the simulations.
Previous work [13] concerned networks with connectivity W whose expression dynamics start from a prespecified initial state S(0) at some time t=0 and arrive at a prespecified stable equilibrium or ‘target’ expression state S(∞); these networks are referred to as ‘viable’. Then, the values of some elements of S(0) or W were changed for each viable network to check whether S(∞) is reached. These studies entailed performing millions of simulations with different network topologies and initial expression states. Although the number of simulations is much smaller than 2^{ n } and ${3}^{{n}^{2}}$ for n=20, some specific conclusions were obtained. First, the fraction of viable networks, that is, networks that arrive at a prespecified target expression state S(∞) given an initial gene expression state S(0) to the total number of possible networks, is generally very small. For moderately sized networks of n=20 genes (with the number M of nonzero w_{ i j } set to be 200, and the fraction d of elements different between S(0) and S(∞) set to 0.5), the fraction of viable networks was found to be v_{ f }=5.1×10^{−9}±1.7×10^{−10}. Due to the large numbers 2^{ n } and ${3}^{{n}^{2}}$, the qualitative correctness of this conclusion is clear. Since there are 2^{ n } possible equilibrium states, even if we only consider the factor of expression states, the probability that a network W arrives at a prespecified S(∞) is expected on the order of 1/2^{ n }. The viable networks in this prior work could be organized as a graph with each node corresponding to a network of a given topology, and two nodes are connected by an edge if they differ by a single regulatory interaction (i.e., they differ in one element w_{ i j }). Remarkably, this graph is connected and can be easily traversed by gradual changes of the network topology. Thus, highly robust topologies can evolve from topologies with low robustness through gradual Darwinian topological changes. These results are claimed to be valid for discrete and continuous w_{ i j } taking values [−1,0,1] and over the interval [−a,a], respectively. While simulations are valuable, they do not provide a complete picture.
Ciliberti et al. [13] considered the case where each element of the initial and equilibrium expression states, S(0) and S(∞), can only take the values 1 or −1 corresponding to maximum and minimum possible protein concentrations. We call them saturated expression states. Under this condition, the present paper provides an analytical robustness assessment of gene networks whose dynamics can be described by (3) and its modification with threshold parameters. This analysis can determine (a) for a given network, which and how many saturated equilibrium states exist, and which and how many saturated initial states converge to each of these saturated equilibrium states; (b) for a given saturated equilibrium state, or a given pair of saturated equilibrium and initial states, which and how many gene networks, referred to as viable, share this saturated equilibrium state or the pair of saturated equilibrium and initial states. We also show that the viable networks sharing a given saturated equilibrium state must follow certain patterns. These capabilities of the analytical treatment make it possible to properly define and accurately determine robustness to noise and mutation for gene networks. Previous network research conclusions drawn from performing millions of simulations follow directly from the results of our analytical treatment. Furthermore, the analytical results provide criteria for identification of model validity and suggest modified models of gene network dynamics.
The paper is organized as follows: Section 2 first defines the saturated state and saturated fixed point attractor for dynamics (3), and then gives the necessary and sufficient condition for a gene network to have a given saturated equilibrium state. Sections 3 and 4 analyze the robustness to noise and mutation, respectively. Section 5 proposes a modification of dynamics (3) with threshold parameters. Section 6 gives an illustration of the practical application of this analytical treatment: the model construction of the yeast cell-cycle network and its robustness assessment. The details of the treatment of the yeast cell-cycle network are given in an Additional file 1: Supplementary information. Finally, Section 7 presents conclusions. Mathematical proofs of the theorems in the main text are given in the Appendix.
2 Saturated states and fixed point attractors
In this work, the initial and equilibrium expression states S_{ i }(0) and S_{ i }(∞) for gene i can only be either active (S_{ i }(0) and S_{ i }(∞)=1) or inactive (S_{ i }(0) and S_{ i }(∞)=−1) [11, 12, 14]. The initial and equilibrium expression states with S_{ i }(0) and S_{ i }(∞)=±1 are referred to as saturated initial and equilibrium expression states. Under the condition that the initial and equilibrium expression states S(0)and S(∞)are saturated, we may analyze the robustness and evolvability of gene networks analytically, as explained below.
2.1 Saturated sigmoidal function
A continuous function f(x) defined on ℜ satisfying (f1) f(x)=1 if x≥1, f(x)=−1 if x≤−1, (f2) f(x) is a strictly increasing and continuous function for x∈[−1,1] and f(0)=0,
is called a saturated sigmoidal function. Furthermore, if (f3) f(x)≥x for x∈(0,1] and f(x)≤x for x∈[−1,0),
we call f(x)] a dissipative saturated sigmoidal function.
2.2 Necessary and sufficient condition for a saturated state S to be an equilibrium state or a fixed point attractor of dynamics (3) with a given W
When the equilibrium states S(∞) are saturated, the analysis of robustness and evolvability for gene networks can be readily performed by utilizing Feng and Tirozzi’s treatment for neural networks [23].
for S(0)∈B(S) and $\sum _{j=1}^{n}{w}_{\mathit{\text{ij}}}{S}_{j}\ne 0$, for all i.
for any i, then −0.99<S_{ i }(t+τ)<0.99 for any t, and the gene network cannot have a saturated fixed point.
which denote, respectively, the two sets of all integers in {1,2,…,n} with S_{ i }=1 and S_{ i }=−1.
Proof. If a saturated state S is a fixed point of (3), it must satisfy (9,10), i.e., (13,14). If (13,14) are satisfied, so are (9,10), then (11) is satisfied and S is a fixed point. A saturated state S satisfying (13,14) (or equivalently (15)) is a fixed point attractor. The proof is given in Theorem A2 in the Appendix. □
There are a total 2^{ n } saturated states S. Since (15) only involves simple multiplication and summation, for a modest n one can test all 2^{ n } saturated vectors S for a given W and find all of its saturated fixed point attractors without iteratively solving the sigmoidal function (3). For n=11 and 2^{11}=2,048, the test takes 0.01 s by Matlab on a Dell Precision Workstation T3400.
There are 2^{4}=16 saturated states. Similar tests for the other 15 saturated states were performed. The case S_{2}=−S_{1}=(1−11−1) is the only other saturated equilibrium state for the network.
3 Robustness to noise
Robustness to noise may be assessed for (a) each saturated equilibrium expression state or (b) a specified pair of saturated equilibrium and initial expression states. In case 1, we need to compare how many of 2^{ n } possible saturated initial expression states converge to each saturated equilibrium expression state; in case 2, we need to determine how many neighbours (differing only in one element) of the S(0) converge to the same saturated equilibrium expression state for a given W[13]. In either case, we need to establish the condition under which a saturated initial expression state converges to a given saturated equilibrium expression state of W.
3.1 Relationship between saturated initial expression states and saturated equilibrium expression states
Theorem 2. If S is a saturated equilibrium expression state (or a fixed point attractor) of dynamics (3) with a given W, so is −S.
which proves that −S is also a saturated fixed point attractor, i.e., a saturated equilibrium expression state for W. Example 1 demonstrates its validity. □
Corollary 1. Dynamics (3) either does not have a saturated equilibrium expression state, or has an even number of saturated equilibrium expression states.
This result can be obtained immediately from Theorem 2.
satisfies Theorem 2 and Corollary 1.
Theorem 3. If S(t) converges to a saturated equilibrium expression state S, then −S(t) converges to −S.
Proof. See Theorem A3 in the Appendix. □
Corollary 2. In the hypercube [−1,1]^{ n }, the volume of the region where points converge to a saturated equilibrium state S is equal to the volume of the region where points converge to −S.
Proof. Considering that S(t) and −S(t) are symmetric and have the same distance to the origin (see Figure 3), then in [−1,1]^{ n } (which is symmetric to the origin) the volume of the region where points converge to S is equal to the volume of the region where points converge to −S. However, if S_{1} and S_{2} are two saturated fixed point attractors, but S_{2}≠−S_{1}, for a given W, generally in [−1,1]^{ n } the volume of the region where points converge to S_{1} may not be equal to the volume of the region where points converge to S_{2}. □
Theorem 4.S=0 is a fixed point, but may not be a fixed point attractor for a W having a saturated fixed point attractor S.
Therefore, 0 may not be a fixed point attractor, but an unstable fixed point. For an unstable fixed point, there exist divergent or both convergent and divergent neighborhoods of 0. In the convergent region, trajectories will be attracted to 0; In the divergent region, trajectories will leave from the neighbourhood of 0. Therefore, it is possible upon starting from a saturated initial expression state S(0) that the solution trajectory arising from the sigmoidal function (3) may converge to 0, or first approach 0, but then enter the divergent region, and move away from that neighborhood. □
under the constraint that α_{ i }≥2.
Proof. See Theorem A4 in the Appendix. □
Theorem 5 provides a way to determine all possible saturated initial expression states converging to a saturated equilibrium expression state of a W. The constraint α_{ i }≥2 implies that the gene network must contain more than one gene if w_{ i j } only takes values in [ −1,0,1]. This is always true.
Number of saturated initial states converging to different final states for the gene network given in ( 17)
Final state | Number of saturated initial states |
---|---|
(1 1 1 −1 −1 −1) | 22 |
(−1 −1 −1 1 1 1) | 22 |
(0 0 0 0 0 0) | 20 |
Using the condition in (24,25), we found that each saturated equilibrium expression state for the gene network given in (17) has 22 saturated initial expression states (including the saturated equilibrium expression state itself). The remaining 20 saturated initial states, which do not satisfy (24,25), converge to 0.
For saturated initial expression states converging to one of the two saturated equilibrium expression states, the condition in (24,25) is satisfied starting from the iteration number k as either 0 or 1. For saturated initial expression states converging to 0, the condition given in (24,25) is never satisfied. Therefore, in this gene network, the sigmoidal function (3) either does not need to be solved, or only needs to be solved just once, to determine which and how many saturated initial expression states converge to a particular saturated equilibrium expression state. The computational effort will be reduced. For a network with 11 genes, using Theorem 5 has approximately 60% CPU time saving compared to completely solving (3).
3.2 Definition for robustness to noise
Robustness to noise may be defined as follows for a network W, for each of its saturated equilibrium expression states, and for a specified pair of saturated equilibrium and initial expression states, respectively.
The subscript n_{ t } denotes ‘noise’ and ‘total’.
According to the definition, larger values of m correspond to worse robustness to noise. This definition is reasonable because more fixed points a W has, then less saturated initial states converging to each of its equilibrium states. Changing an initial state has more chances to cause change of the equilibrium state.
In this definition, m should also include limit cycles. Since there is no simple way (like the Banach fixed point theorem for the existence of fixed point attractor) to determine the existence of limit cycle for sigmoidal functions, in the current work, we are unable to include it. The robustness ${R}_{{n}_{t}}$ takes on values in [0,1]. For W given in Example 2, there are three fixed points: 0, (1 1 1 − 1 −1 −1) and (−1 −1 −1 1 1 1). Therefore, ${R}_{{n}_{t}}=1/3$, which implies that there is a 2/3 probability for having a saturated equilibrium expression state change caused by a saturated initial expression state change.
The subscripts n and i denote ‘noise’ and the i th saturated equilibrium expression state. For saturated equilibrium expression states (1 1 1 −1 −1 −1) and (−1 −1 −1 1 1 1) in Example 2, ${R}_{{n}_{i}}=22/64\approx 1/3\phantom{\rule{0.3em}{0ex}}(i=1,2)$.
Ciliberti et al. [13] argued that the pair of saturated equilibrium and initial expression states, S(∞) and S(0), play a central role for a viable network, but the variation of initial expression state in realistic cases is often mild. They define one measure of robustness to noise as the probability R_{v,I} that a change in one gene’s expression state in the saturated initial expression state S(0) leaves the unchanged network’s saturated equilibrium state S(∞). Following this pattern, we also define the measure of robustness to noise for a given pair of saturated equilibrium and initial expression states with a viable network as follows.
Example 5. Determination of ${R}_{{n}_{\mathit{\text{ij}}}}$ for the network in Example 2.
Distribution of${\mathit{R}}_{{\mathit{n}}_{\mathit{ij}}}$ for the network W in Example 2
Final state | ${\mathit{R}}_{{\mathit{n}}_{\mathit{ij}}}$ | |
---|---|---|
1 | 1/3 | |
(1 1 1 -1 -1 -1) | 7 | 15 |
(-1 -1 -1 1 1 1) | 7 | 15 |
For S_{ i }, the seven S_{ j }(0)s in the 22 saturated initial states with ${R}_{{n}_{\mathit{\text{ij}}}}=1$ are S_{ i } itself and those differing from S_{ i } by only one element; the other fifteen S_{ j }(0)s in the 22 saturated initial states with ${R}_{{n}_{\mathit{\text{ij}}}}=1/3$ are those differing from S_{ i } by two elements. The S_{ j }(0) differing from S_{ i } by more than two elements does not converge to the S_{ i }.
4 Robustness to mutations
For robustness to noise, we need to find all possible saturated equilibrium expression states for a given gene network W. In contrast, for robustness to mutations, we have the opposite task: for a given saturated equilibrium expression state we need to determine which and how many W s share this saturated equilibrium expression state.
4.1 Conditions under which gene networks share the same saturated equilibrium expression state
under the condition w_{ i j }∈[−a,a].
Proof. Note that the rows of W are independent. When S is the saturated equilibrium expression state of W, the elements of each row (for example, the i th row w_{ i j }(j=1,2,…,n)) of W must satisfy either (13) or (14), i.e., (29) or (30) which is an inequality with n variables. The inequality is solvable and has an infinite number of solutions. Those are the desired solutions with each w_{ i j } located within the required range [−a,a]. For discrete w_{ i j } (only taking values [−1,0,1]), the number of solutions is finite. All the solutions can be completely counted and determined by solving (29) or (30). □
- 1.
For i∈J ^{+}(S), increasing the value of the first term of (29) (if the increase does not make the value of w _{ i j } larger than the upper bound a) will not violate the inequality, i.e., increasing the value of w _{ i j }(j∈J ^{+}(S)) will keep the same saturated equilibrium state S. This behavior implies that either increasing the activation or decreasing the repression influence of active gene j on active gene i at equilibrium state S will not change the saturated equilibrium state.
- 2.
For i∈J ^{+}(S), decreasing the value of the second term of (29) (if the decrease does not make the value of w _{ i j } smaller than the lower bound −a) will not violate the inequality, i.e., decreasing the value of w _{ i j }(j∈J ^{−}(S)) will keep the same saturated equilibrium state S. This behavior implies that either decreasing the activation or increasing the repression influence of inactive gene j on active gene i at equilibrium state S will not change the saturated equilibrium state.
- 3.
Similarly, for i∈J ^{−}(S), either increasing the activation or decreasing the repression influence of inactive gene j on inactive gene i at equilibrium state S will not change the saturated equilibrium state.
- 4.
For i∈J ^{−}(S), either decreasing the activation or increasing the repression influence of active gene j on inactive gene i at equilibrium state S will not change the saturated equilibrium state.
Here, the discrete values [−1,0,1] are required for the elements w_{ i j }. Thus, we seek to determine all gene networks W sharing the same two saturated equilibrium expression states given in Example 2 (due to Theorem 2, −S is also a saturated equilibrium expression state).
- 1.
$\sum _{j=4}^{6}{w}_{\mathit{\text{ij}}}=-3$
- 2.
$\sum _{j=4}^{6}{w}_{\mathit{\text{ij}}}=-2$
- 3.
$\sum _{j=4}^{6}{w}_{\mathit{\text{ij}}}\ge -1$
In the construction of w k+, for a given (w_{i 4},w_{i 5},w_{i 6}), all possible patterns of (w_{i 1},w_{i 2},w_{i 3}) are considered. For a given (w_{i 1},w_{i 2},w_{i 3}), a similar treatment for (w_{i 4},w_{i 5},w_{i 6}) was performed. Thus, for any w k+, we can always find a w l+ differing from it only by a single w_{ i j }. This is also true for w k−. In this example, w k+ and w k−(k=2,3,…,7) differ from w 1+ and w 1− only by a single w_{ i j }, respectively.
These gene networks sharing the same saturated equilibrium expression states are referred to as ‘viable’ networks (see [20]). This definition is different from that given by Ciliberti et al. [13], where viable networks were those sharing a prespecified pair of saturated initial and equilibrium expression states. The viable networks defined by sharing a pair of saturated initial and equilibrium expression states are a subset of the viable networks defined by sharing a saturated equilibrium expression state only.
The analysis here about viable networks implies that for a given saturated state, all viable networks having it as an equilibrium state must follow certain patterns, i.e., its rows must be chosen from finite permitted rows. The permitted rows for a given saturated equilibrium state have specific biological meaning and reflect the required connectivity patterns of each gene to other genes. This restriction distinguishes viable networks for a given equilibrium state from other viable networks with distinct equilibrium states as well as inviable networks.
4.2 Definitions of robustness to mutation
The number of viable networks in the example above 7^{6}=117,649 itself is large, but the total number of possible gene networks with n=6 is ${3}^{{6}^{2}}\approx 1.5\times 1{0}^{17}$. The fraction of viable gene networks in the total number of possible gene networks is 7^{6}/3^{36}≈7.8383×10^{−13}, even smaller than that obtained previously in numerical simulations [13] for n=20. Based on the above analysis for viable gene networks, it seems plausible to define robustness to mutation for a given saturated equilibrium state as the ratio of the number of viable networks to the total number of possible networks for a given n. If so, it would appear that none of the viable gene networks is robust to mutation because the ratio is very small.
The latter inference is misleading because most of the possible gene networks have no similarity in topology with the viable gene networks having a specific saturated equilibrium state, and there is rarely a chance that a viable gene network will suddenly change to one of them. In normal circumstances, the structure perturbations due to mutation are small and the topology can only change gradually. A viable network may experience topology changes step-by-step, and in each step, only one w_{ i j } changes. Ciliberti et al. [13] defined mutational robustness for a viable gene network as the fraction of its one-mutant neighbors that are also viable, and we follow the same criterion. In the following discussion, w_{ i j } is restricted only to take the discrete values [−1,0,1].
We then define robustness to mutation as follows:
Here the subscripts m and i in ${R}_{{m}_{i}}$ denote ‘mutation’ and the i th saturated equilibrium expression state; ${N}_{W}^{v}$ and ${N}_{{w}_{i}}^{v}$ respectively are the total numbers of viable single w_{ i j } changes (which is also the number of one-mutant viable neighbors) of W and its i th row; N_{ W } is the total number of possible single w_{ i j } changes of W. For a given W with a specified saturated equilibrium state, its robustness to mutation can be readily calculated from ${N}_{{w}_{i}}^{v}$ of each row. In Example 5, since each row of a viable network must be one of the seven rows in W^{+} or W^{−}, and ${N}_{{w}_{1}^{+}}^{v}={N}_{{w}_{1}^{-}}^{v}=6$, ${N}_{{w}_{k}^{+}}^{v}={N}_{{w}_{k}^{-}}^{v}=1(k=2,\dots ,7)$, the value of ${R}_{{m}_{i}}$ depends on how many w 1+ and w 1− are contained in W.
The robustness to mutation ${\mathit{R}}_{{\mathit{m}}_{\mathit{i}}}$ of W _{ k }
W _{ k } | ${\mathit{N}}_{\mathit{W}}^{\mathit{v}}$ | ${\mathit{R}}_{{\mathit{m}}_{\mathit{i}}}$ | W _{ k } | ${\mathit{N}}_{\mathit{W}}^{\mathit{v}}$ | ${\mathit{R}}_{{\mathit{m}}_{\mathit{i}}}$ |
---|---|---|---|---|---|
W _{1} | 0 | 0 | W _{5} | 21 | 21/72 |
W _{2} | 6 | 6/72 | W _{6} | 26 | 26/72 |
W _{3} | 11 | 11/72 | W _{7} | 31 | 31/72 |
W _{4} | 16 | 16/72 | W _{8} | 36 | 36/72 |
W_{1} is inviable because its first row does not belong to any row of W^{+} or W^{−}. The other W_{ k } s are viable. As mentioned above, the value of ${R}_{{m}_{i}}$ of a viable W depends on how many w 1+ and w 1− are contained in the network. From W_{2} to W_{8}, more and more w 1+ and w 1− are included. Thus, the corresponding ${N}_{W}^{v}$ and ${R}_{{m}_{i}}$ become larger and larger, and W_{8} is the most stable one with ${N}_{W}^{v}=36$ (i.e., having 36 one-mutant viable neighbors) and ${R}_{{m}_{i}}=1/2$.
Following [13], we also define robustness to mutation, ${R}_{{m}_{\mathit{\text{ij}}}}$, as follows:
Here, the subscript m and i,j in ${R}_{{m}_{\mathit{\text{ij}}}}$ respectively denote ‘mutation’ and the i th saturated equilibrium expression state S_{ i } and j th saturated initial expression state S_{ j }(0); ${N}_{{m}_{\mathit{\text{ij}}}}^{v}$ is the total numbers of viable single w_{ i j } changes of W with respect to S_{ i } and S_{ j }(0), which can be obtained by testing how many viable networks in ${N}_{W}^{v}$ share S_{ j }(0).
Example 8. Determination of ${R}_{{m}_{\mathit{\text{ij}}}}$ for W_{2} and W_{3} in Example 7.
The distribution of ${\mathit{R}}_{{\mathit{m}}_{\mathit{ij}}}$ for the network W _{ 3 }
Final state | ${\mathit{R}}_{{\mathit{m}}_{\mathit{ij}}}$ | ||||
---|---|---|---|---|---|
5/72 | 6/72 | 7/72 | 9/72 | 11/72 | |
S _{ 1 } | 7 | 0 | 3 | 0 | 22 |
S _{ 2 } | 1 | 6 | 0 | 3 | 22 |
4.3 Topology evolution of gene networks
In going from W_{2} to W_{1}, the gene network no longer attains the saturated equilibrium expression state. Thus, we may consider W_{1} as ‘dead’. In going from W_{2} to W_{3}, however, not only is the saturated equilibrium expression state retained, but also the robustness to mutation becomes higher. Suppose that all possible single w_{ i j } changes have the same probability, then the gene network with higher ${R}_{{m}_{i}}$ has a greater chance to ‘survive’. This implies that highly robust topologies can evolve from topologies with low robustness through gradual Darwinian topological changes or ‘natural selection’.
Ciliberti et al. [13] suggested that all viable networks attaining a given gene expression state can be organized into a graph whose nodes are networks that differ in their topology. Two networks (nodes) in the graph are connected by an edge if they differ in the value of only one regulatory interaction (w_{ i j }). As proved above, for a viable network W_{ i }, we can always find one or more viable networks, W_{ j }s differing by only one w_{ i j } from it. The number of viable neighbors differing by a single w_{ i j } for a viable network W is simply the value of its ${N}_{W}^{v}$ (see Table 2). Therefore, any two viable networks W_{ i } and W_{ j } with k different elements w_{ i j } can be connected by a path with k edges and k−1 viable networks between them. For example, W_{2} and W_{8} have six different diagonal w_{ i j }s, and they are connected by a path with six edges and five viable networks between them. This circumstance implies that all viable networks can be organized to comprise a large graph which can be easily traversed by a sequence of single w_{ i j } changes of network topology. Thus, robustness is an evolvable property. To draw this conclusion, a previous study performed millions of simulations [13], but the analytical treatment here directly leads to this result.
5 Modified sigmoidal function with threshold parameters
In this case, (51) may not be satisfied, and −S may not be a saturated equilibrium state.
If θ_{ i }<0 for all i∈J^{+}(S) and θ_{ i }>0 for all i∈J^{−}(S), then both (50) and (51) may not be satisfied for −S, and −S may not be a saturated equilibrium state for W. In Example 2, when the model (48) is used with θ_{ i }=−2(i=1,2,3) and θ_{ i }=2(i=4,5,6), then the network given in (17) only has a single saturated equilibrium state (1 1 1 −1 −1 −1), and all saturated initial states converge to it. Thus, the problem reduces to choosing the parameter θ_{ i } and giving it biological interpretation. Then, we have
i.e., S is a saturated fixed point with 0.99 (β=5) or 0.9999 (β=10) confidence level. A saturated state S satisfying (58,59, or 60) is a fixed point attractor. The proof is given in Theorem A5 in the Appendix. □
Similarly, we can have
under the condition w_{ i j }∈[−a,a], and θ_{ i }∈[−b,b].
Proof. The proof is the same as that for Theorem 6. □
6 Application to a yeast cell-cycle network
Fixed point attractors of the cell-cycle network and the number of saturated states (basin size) converging to them
Basin | Cln3 | MBF | SBF | Cln1,2 | Cdh1 | Swi5 | Cdc20 | Clb5,6 | Sic1 | Clb1,2 | Mcm1 |
---|---|---|---|---|---|---|---|---|---|---|---|
size | |||||||||||
1,764 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
151 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
109 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
7 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
Note that dynamics (64) is different from dynamics (3). For $\sum _{j}{a}_{\mathit{\text{ij}}}{S}_{j}\left(t\right)=0$, dynamics (3) gives S_{ i }(t+1)=0, not S_{ i }(t). Dynamics (3) with the W constructed directly from the connectivities in Figure 7b will not give the same result as that given by dynamics (64). All the information given by the simplified model for yeast cell-cycle network (Figure 7b) will be considered as ‘available experimental information’ for budding yeast, and used as an example to illustrate our analytical treatment for network construction and its robustness analysis. Hereafter, 0 representing the inactive state by Li et al. will be replaced by −1. Only the main results are presented here; see the online Supplementary information (Additional file 1) for more details.
6.1 Construction of viable networks
As mentioned by Li et al. [24], ‘the overall dynamic properties of the network are not very sensitive to the choice of these parameters’ (w_{ i j }), but the connectivity patterns of the network, i.e., the regulatory influence between genes (activation, repression, and absence) is important for determining gene network robustness. Therefore, we restrict w_{ i j } to only take the discrete values 1 (activation), −1 (repression), and 0 (absence).
There are many choices of practically relevant networks.
Condition (79,80) can be used to construct all viable networks W for dynamics (76) with a given set of θ_{ i }s sharing the saturated equilibrium state S_{1}. Since there is no unambiguous biological interpretation for the values of θ_{ i }, as [−1,0,1], to represent activation, repression, absence for w_{ i j }, we will not construct all such viable networks here.
6.2 Construction of yeast cell-cycle networks
W_{0} does not satisfy condition (70,71) for any saturated state and does not have a saturated equilibrium state for dynamics (69). However, W_{0} will be used as the basis for the connectivities of the network to construct networks for dynamics (69) with the saturated equilibrium expression state S_{1}. The construction of networks reduces to satisfying condition ( 74, 75) or ( 79, 80) as much as possible consistent with experimental observation.
have been obtained (the detailed procedure for their construction can be found in the Additional file 1: Supplementary information). W_{1} and W_{2} differ only for w_{8,10}.
6.3 Saturated equilibrium expression states for constructed networks
- 1.
W _{1}
- 2.
W _{2}
with
- 3.
W _{0} with Θ given in (85)
6.4 Robustness to noise
The number of saturated initial states converging to different equilibrium states for different gene networks
Final state | Number of saturated initial states | ||
---|---|---|---|
W_{ 0 }withΘ | W _{ 1 } | W _{ 2 } | |
S _{ 1 } | 2,048 | 1,024 | 979 |
S _{ 2 } | 1,024 | 979 | |
S _{ 3 } | 45 | ||
S _{ 4 } | 45 |
The Robustness to noise ${\mathit{R}}_{{\mathit{n}}_{\mathit{t}}}$ and ${\mathit{R}}_{{\mathit{n}}_{\mathit{i}}}$ for different gene networks
Network | ${\mathit{R}}_{{\mathit{n}}_{\mathit{t}}}$ | ${\mathit{R}}_{{\mathit{n}}_{\mathit{i}}}$ | |||
---|---|---|---|---|---|
S_{ 1 } | S_{ 2 } | S_{ 3 } | S_{ 4 } | ||
W_{0} with Θ | 1 | 1 | |||
W _{1} | 1/2 | 1/2 | 1/2 | ||
W _{2} | 1/4 | 0.478 | 0.478 | 0.022 | 0.022 |
The distribution of ${\mathit{R}}_{{\mathit{n}}_{\mathit{ij}}}$ for the networks W _{ 0 } with Θ and W _{ 1 }
Final state | ${\mathit{R}}_{{\mathit{n}}_{\mathit{ij}}}$(forW_{ 0 }withΘ) | ${\mathit{R}}_{{\mathit{n}}_{\mathit{ij}}}$(forW_{ 1 }) |
---|---|---|
11/11 | 10/11 | |
S _{ 1 } | 2,048 | 1,024 |
S _{ 2 } | 1,024 |
The distribution of ${\mathit{R}}_{{\mathit{n}}_{\mathit{ij}}}$ for the network W _{ 2 }
Final state | ${\mathit{R}}_{{\mathit{n}}_{\mathit{ij}}}$ | ||||||||
---|---|---|---|---|---|---|---|---|---|
2/11 | 3/11 | 4/11 | 5/11 | 6/11 | 7/11 | 8/11 | 9/11 | 10/11 | |
S _{ 1 } | 1 | 12 | 29 | 146 | 791 | ||||
S _{ 2 } | 1 | 12 | 29 | 146 | 791 | ||||
S _{ 3 } | 1 | 6 | 21 | 7 | 3 | 7 | |||
S _{ 4 } | 1 | 6 | 21 | 7 | 3 | 7 |
The results show that W_{0} with Θ is completely stable for any viable pair; for W_{1}, there is one neighbour of S_{ j }(0) differing at the first element, which causes changes in the saturated equilibrium state S_{ i }; for W_{2}, the distribution of ${R}_{{n}_{\mathit{\text{ij}}}}$ is divergent, and S_{1} and S_{2} are much more stable than S_{3} and S_{4}.
6.5 Robustness to mutation
The robustness to mutation ${\mathit{R}}_{{\mathit{m}}_{\mathit{i}}}$ of W _{ 0 } with Θ , W _{ 1 } , and W _{ 2 }
Final state | W_{ 0 }withΘ | W _{ 1 } | W _{ 2 } | |||
---|---|---|---|---|---|---|
${\mathit{N}}_{\mathit{W}}^{\mathit{v}}$ | ${\mathit{R}}_{{\mathit{m}}_{\mathit{i}}}$ | ${\mathit{N}}_{\mathit{W}}^{\mathit{v}}$ | ${\mathit{R}}_{{\mathit{m}}_{\mathit{i}}}$ | ${\mathit{N}}_{\mathit{W}}^{\mathit{v}}$ | ${\mathit{R}}_{{\mathit{m}}_{\mathit{si}}}$ | |
S _{ 1 } | 137 | 0.57 | 140 | 0.58 | 143 | 0.59 |
S _{ 2 } | 140 | 0.58 | 143 | 0.59 | ||
S _{ 3 } | 131 | 0.54 | ||||
S _{ 4 } | 131 | 0.54 |
7 Conclusion
Based on the determination of saturated fixed point attractors for the sigmoidal function model in (3) with a given gene network, W, one can analytically determine which and how many saturated equilibrium expression states exist. Furthermore, for each saturated equilibrium expression state of a W, which and how many saturated initial expression states converging to it can also be determined. These results make it possible to establish the robustness of a given gene network to noise without performing a large number of simulations. Based on the necessary and sufficient condition for gene networks to share the same saturated equilibrium expression state, one can determine all the viable gene networks for a specified saturated equilibrium state. This result also makes it possible to establish the robustness to mutation for a network with a specified saturated equilibrium expression state or a specified pair of saturated equilibrium and initial expression states.
The analytical treatment presented here proved that for a given saturated state, all viable gene networks having it as an equilibrium state must follow certain patterns, i.e., the rows of the corresponding W must be chosen from a finite number of permitted rows. The permitted rows for a given saturated equilibrium state have specific biological meaning and reflect the required connectivity patterns of each gene to other genes. This restriction distinguishes the viable networks for a given saturated equilibrium state from other viable networks with distinct saturated equilibrium states as well as inviable networks. The analysis also proved, without performing a very large numbers of simulations, that all viable networks can be organized as a large graph which can be easily traversed by a sequence of single w_{ i j } changes of network topology. Thus, robustness is an evolvable property. Highly robust topologies can evolve from topologies with low robustness through gradual Darwinian topological changes or natural selection. The analytical treatment presented in this paper may be employed not only for robustness analysis but also for the model construction and analysis of other properties for gene networks.
Appendix
The appendix proves several theorems in the main text.
Lemma 1(Banach Fixed Point Theorem[25])
Let (X,d) be a non-empty complete metric space with a contraction mapping g:X→X. Then g admits a unique fixed point x^{∗} in X (i.e. g(x^{∗})=x^{∗}). Furthermore, x^{∗} can be found as follows: start with an arbitrary element x_{0}∈X and define a sequence {x_{ n }} by x_{ n }=g(x_{n−1}), then x_{ n }→x^{∗}.
where d denotes the distance, for all x, y∈X.
This condition is sufficient, but not necessary. It is possible that one of (95 to 97) is satisfied, but the other two may not. Such examples can be constructed.
Proof. Since 0 is a fixed point for any W, to see whether it is a unique fixed point attractor, we need to determine under what condition (3) is a contraction mapping.
The condition in (111) is sufficient, but not necessary. □
Theorem A2. A saturated state S satisfying (13) and (14) is a fixed point attractor.
Proof. Suppose S is a fixed point of (3). Let ${B}_{\epsilon}=\{\widehat{\mathbf{S}}\in {\Re}^{n}\mid \parallel \mathbf{\text{X}}=\widehat{\mathbf{S}}-\mathbf{\text{S}}\parallel <\epsilon \}$, where ε is chosen sufficiently small such that there is only a single fixed point S within B_{ ε }. X=0 is a fixed point in B_{ ε } with representation X because S is a fixed point for $\widehat{\mathbf{S}}$ in B_{ ε }. If we can prove that X=0 is a fixed point attractor in B_{ ε }, then so is S.
where β^{′}≥β.