A hybrid multiscale Monte Carlo algorithm (HyMSMC) to cope with disparity in time scales and species populations in intracellular networks

Samant, Asawari; Ogunnaike, Babatunde A; Vlachos, Dionisios G

doi:10.1186/1471-2105-8-175

Methodology article
Open access
Published: 24 May 2007

A hybrid multiscale Monte Carlo algorithm (HyMSMC) to cope with disparity in time scales and species populations in intracellular networks

Asawari Samant¹,
Babatunde A Ogunnaike¹ &
Dionisios G Vlachos¹

BMC Bioinformatics volume 8, Article number: 175 (2007) Cite this article

5162 Accesses
19 Citations
Metrics details

Abstract

Background

The fundamental role that intrinsic stochasticity plays in cellular functions has been shown via numerous computational and experimental studies. In the face of such evidence, it is important that intracellular networks are simulated with stochastic algorithms that can capture molecular fluctuations. However, separation of time scales and disparity in species population, two common features of intracellular networks, make stochastic simulation of such networks computationally prohibitive. While recent work has addressed each of these challenges separately, a generic algorithm that can simultaneously tackle disparity in time scales and population scales in stochastic systems is currently lacking. In this paper, we propose the hybrid, multiscale Monte Carlo (HyMSMC) method that fills in this void.

Results

The proposed HyMSMC method blends stochastic singular perturbation concepts, to deal with potential stiffness, with a hybrid of exact and coarse-grained stochastic algorithms, to cope with separation in population sizes. In addition, we introduce the computational singular perturbation (CSP) method as a means of systematically partitioning fast and slow networks and computing relaxation times for convergence. We also propose a new criteria of convergence of fast networks to stochastic low-dimensional manifolds, which further accelerates the algorithm.

Conclusion

We use several prototype and biological examples, including a gene expression model displaying bistability, to demonstrate the efficiency, accuracy and applicability of the HyMSMC method. Bistable models serve as stringent tests for the success of multiscale MC methods and illustrate limitations of some literature methods.

Background

Stochastic behavior, resulting from the small population of species in an intracellular medium, can stimulate interesting molecular phenomena, such as noise-induced bifurcations, phenotypic heterogeneity, etc. Through such phenomena, intracellular noise can have an impact on the physiology of the cell [1, 2]. It is therefore important that in silico analysis of intracellular networks, aimed at relating cell function to molecular events, is capable of capturing fluctuations. Exact stochastic algorithms, such as the stochastic simulation algorithm (SSA) [3, 4] and its variants [5], can capture the molecular noise by accounting for the random nature of biochemical processes.

The SSA [3, 4] provides the exact solution but can be computationally intensive, especially when large populations and/or reaction networks are involved. τ-leap methods [6–9], which accelerate the simulation by firing multiple reactions in a coarse time step, provide an excellent alternative when large populations are involved. However, the original Poisson-based τ-leap method [6] and its modifications, the binomial τ-leap [7, 8] and the implicit τ-leap [10] methods, do not perform too well at low populations. Therefore, an intracellular network comprised of mixed population scales is not amenable to a τ-leap treatment. A modified τ-leap algorithm [11] was recently developed to avoid negative populations that can occur in the Poisson τ-leap method. Implicitly, this modification results in a hybrid, stochastic algorithm that handles mixed population levels by seamlessly switching between the SSA for low population species and the Poisson τ-leap method for large population species [11]. Similarly, the partitioned leaping method [12] combines the exact next reaction method [5] with the Poisson τ-leap [6] method to enable stochastic simulations over a disparate range of populations.

Another aspect of biological networks plaguing stochastic simulation is the presence of processes with disparate reaction rates. Fast reactions, which are sampled more frequently by the SSA, reduce the size of the time step. As a result, it is difficult to simulate macroscopic real-time. τ-leap methods are also inefficient when a large separation of time scales is encountered. Most multiscale, stochastic algorithms [13–21] accelerate simulation of stiff networks by reducing the time spent in simulating the fast network. One way to achieve this is to use an approximate, accelerated algorithm, such as the Langevin method [22], to speed up the simulation of fast reactions [15, 19]. This approach tacitly assumes that the faster kinetics arise purely from the large populations, and is not applicable to cases involving fast reactions with small populations. An alternative multiscale approach [13, 16–18, 20] based on the slow-scale SSA (SS-SSA) [13, 14] concept, uses information from the quasi-equilibrium (QE) description of the fast network to evolve the slow network (see Appendix A). In this multiscale approach, individual networks are modeled with the SSA, and thus, large populations in either the fast or the slow network cannot be handled effectively.

A generic stochastic algorithm, which simultaneously addresses the disparity in time scales and species populations in well-mixed reaction networks, is currently lacking (see Figure 1). In this paper, we propose a hybrid, multiscale Monte Carlo (abbreviated as HyMSMC) algorithm to fill in this gap (a flow chart of the steps involved is presented in (Figure 2). Like our original multiscale Monte Carlo (MSMC) algorithm [17], the proposed HyMSMC algorithm makes no a priori assumptions about the scales and collapses to the exact SSA and/or non-multiscale stochastic solver, when a scale separation is absent. Additional new elements in our work include: (1) the introduction of the computational singular perturbation (CSP) framework, as an auxiliary tool, to aid network partitioning and determine relaxation times of the fast network for its convergence and (2) of a new statistical relaxation criterion that eliminates the need for the complete description of the QE probability distribution function (PDF), and (3) the applicability of the HyMSMC method to networks displaying bistability in the fast or the slow networks. To the best of our knowledge, this is the first successful application of a multiscale, stochastic algorithm to a bistable network, reported in the literature.

The paper is organized as follows. We begin with a brief introduction of the SSA notation, and then provide a detailed explanation of the CSP-assisted partitioning technique and the new statistical relaxation criterion. Next, we introduce the new hybrid multiscale Monte Carlo (HyMSMC) algorithm. Finally, we demonstrate the efficiency, accuracy and generality of the new algorithm with a few prototype and real biological examples.

Results and Discussion

Stochastic simulation algorithm (SSA) Notation

The notation we follow is identical to that of Gillespie [3, 4]. Consider a well-mixed, isothermal system of n species S≡ {S₁, S₂, ..., S_n} reacting through m reactions R≡ {R₁, R₂, ..., R_m}. Let the state of the system at any time t be denoted by a n-dimensional vector, X(t) = (X₁(t), X₂(t), ..., X_n(t)), where X_i(t) is the number of molecules of species S_i at time t. Let the n-dimensional vector υ_jcorrespond to the stoichiometry vector of reaction R_j, such that υ_ij is the stoichiometric coefficient of species S_i in reaction R_j. Given that the system is in a state X(t) = x at time t, we define a propensity function, a_j(x, t) such that a_j(x,t)dt gives the transition probability of the j^th reaction, Rj, occurring in an infinitesimally small time interval (t, t+dt). This function is typically dependent on the state of the species and the reaction conditions of the network, such as temperature and pressure.

Stochastic Quasi-Equilibrium, Network Partitioning, Relaxation, and Evolution

a. Quasi-equilibrium (QE) in stochastic systems

In dealing with numerical "stiffness" in stochastic systems, arising from separation of time scales, one needs to account for the probabilistic nature of the QE [23], which is given by a distribution of states, rather than a single state. Accordingly, every state of the slow variables determines a QE PDF of the fast network, i.e., a stochastic low-dimensional manifold. Stochastic QE is defined by a time-invariant QE PDF, i.e., the existence of a stable equilibrium of the fast network [13]

\lim_{t^{'} \to \infty} \hat{P} (x^{f}, t^{'} | x, t) = P_{\infty} (x^{f} | x),

(1)

where $\hat{P}$ (x^f, t'|x, t) is the probability of observing ${\hat{X}}^{f}$ (t') = x^f, given that the system is in a state x at time t. ${\hat{X}}^{f}$ (t) is a stochastic process that is identical to X^f(t), but describes the evolution of the fast species purely via the fast network.

The fast species relax to the stationary PDF, P_∞(x^f|x), at t'→∞ (asymptotic limit); in practice, this happens over a relaxation time, $τ_{R e l}^{f}$ , of the fast network. Accuracy of the QE approximation requires that the relaxation time, $τ_{R e l}^{f}$ , be much smaller than the smallest time scale, $\min_{j \in R^{s}} (τ_{j}^{s})$ , of the slow network

\lim_{t^{'} \to τ_{R e l}^{f}} \hat{P} (x^{f}, t^{'} | x, t) \approx P_{\infty} (x^{f} | x) .

(2)

Provided that Eq. (2) is satisfied, Eq. (1) transforms into

\lim_{t^{'} \to τ_{R e l}^{f}} \hat{P} (x^{f}, t^{'} | x, t) \approx P_{\infty} (x^{f} | x) .

(3)

Obtaining this probabilistic description and then using it in the evolution of the slow network forms the core of multiscale, stochastic MC methods [13, 16, 17, 20] (see also Appendix A). The solver used to compute it is called microscopic solver. A difficulty is that the relaxation time, $τ_{R e l}^{f}$ , is unknown in complex systems. We have indirectly addressed this issue in Refs. [17, 24] and revisit it below.

b. Partitioning of a stiff stochastic system

In our algorithm, one method we use follows the partitioning scheme proposed by Cao et al. [13] by identifying the fast and slow processes based on reaction propensities, a_j(x, t). We identify a fast reaction subset $R^{f} \equiv {R_{1}^{f}, R_{2}^{f}, ..., R_{m^{f}}^{f}}$ with m^f reactions, and a slow reaction subset $R^{s} \equiv {R_{1}^{s}, R_{2}^{s}, ..., R_{m^{s}}^{s}}$ with m^s reactions. All species (reactants and products) participating in fast reactions are defined as fast species, and the remaining species as slow species. The state is given by X(t) = (X^f(t),X^s(t)), where $X^{f} (t) = (X_{1}^{f}, X_{2}^{f}, .., X_{n^{f}}^{f})$ and $X^{s} (t) = (X_{1}^{s}, X_{2}^{s}, .., X_{n^{s}}^{s})$ are the states of the fast and slow species, respectively. n^f and n^s are the number of fast and slow species, respectively, and n = n^f+n^s is the total number of species. The propensity function of the slow reaction $R_{j}^{s}$ and of the fast reaction $R_{j}^{f}$ at state X(t) = (x^f,x^s) can be generally written as $a_{j}^{s} (x; t) = a_{j}^{s} (x^{f}, x^{s}; t)$ , and $a_{j}^{f} (x; t) = a_{j}^{f} (x^{f}; t)$ , respectively.

Partitioning is done according to a user-defined threshold of the propensities. The choice of this threshold is influenced by the required accuracy and the computational cost. Using simple model systems [17], it was found that a separation of time scales of at least 2 orders of magnitude is necessary for the MSMC method to be accurate and computationally more efficient; this value is further discussed below using new and more complex examples. This ranked propensity-based partitioning scheme does not always yield a single cut-off point, especially for large networks with multiple gaps in time scales. Furthermore, it does not identify the relaxation time. We propose the use of the computational singular perturbation (CSP) technique to assist with these difficulties.

Computational Singular Perturbation (CSP)-assisted stochastic partitioning

The computational singular perturbation (CSP) technique was introduced by Lam, Goussis and co-workers, e.g., [25, 26], as a diagnostic tool for gaining physical insight into the reaction dynamics of large complex networks, and as an accelerated deterministic solver for stiff networks. Our principal interest in the CSP approach is in its diagnostic power, specifically its ability to identify the species present in QE, and the dominating reactions in the fast and slow modes. We refer to our use of CSP techniques in network partitioning as a "CSP-assisted stochastic partitioning".

Consider the deterministic representation of the dynamics of a reaction network

g (x) = \frac{d X}{d t} = \sum_{i = 1}^{m} υ_{i} a_{i} (x) .

(4)

g(x) is an n-dimensional vector, such that g_i(t) = dX_i/dt describes the rate of change of species S_i. Differentiating (4), yields

\frac{d g (x)}{d t} = \frac{\partial g}{\partial X} \cdot \frac{d X}{d t} = \underline{\underline{J}} \cdot g,

(5)

where $\underline{\underline{J}} = \frac{\partial g}{\partial X}$ is the Jacobian matrix of g. In general, the jacobian matrix, $\underline{\underline{J}}$ is not a diagonal matrix. We perform a linear transformation on g,

f = \underline{\underline{b}} \cdot g,

(6)

such that the transformed mode f evolves in a decoupled manner,

\frac{d f}{d t} = \underline{\underline{Λ}} \cdot f .

(7)

The matrix $\underline{\underline{b}}$ consists of a set of n linearly independent row vectors, b_i, and is chosen appropriately to decouple the evolution of the transformed modes, f. $\underline{\underline{Λ}}$ is a diagonal matrix with the diagonal elements, |Λ_ii|, arranged in the order of descending magnitudes. Every vector, b_i, is associated with a vector, $b_{i}^{*}$ through the biorthonormality condition, such that,

b_{i}^{*} \cdot b_{j} = 1 (i = j) a n d b_{i}^{*} \cdot b_{j} = 0 (i \neq j)

(8)

The matrix $\underline{\underline{b^{*}}}$ consists of n linearly independent column vectors, $b_{i}^{*}$ , and is called the basis vectors set. The matrix $\underline{\underline{b}}$ , which is the inverse of the basis vector set, is called the dual vector set. Lam defined an ideal basis vector set, $\underline{\underline{b^{*}}}$ as one that completely decouples the evolution of the transformed modes [25, 26]. The CSP method provides an iterative refinement procedure [25, 27] to obtain the ideal basis vector set from a random trial basis set.

For linear systems, the Jacobian matrix is independent of time, and thus, the right hand eigenvectors of the Jacobian, $\underline{\underline{J}}$ , can be conveniently used as the ideal basis vector, $\underline{\underline{b^{*}}}$ (assuming $\underline{\underline{J}}$ is a perfect matrix of linearly independent eigenvectors). In this case, $\underline{\underline{b}}$ is the inverse of the eigenvector matrix of $\underline{\underline{J}}$ . For nonlinear systems, the eigenvectors of $\underline{\underline{J}}$ are time-dependent, and do not constitute an ideal basis vector set. Lam [25] proved that using the eigenvectors as an ideal basis set for nonlinear systems provides a leading order accuracy. Since we employ the CSP technique as a partitioning guide, leading order accuracy suffices as our examples indicate.

The elements Λ_ii represent the time scales of the system, with a possible gap in the magnitude of Λ_ii reflecting a time-scale separation. Assuming that such a gap can be identified, i.e., the system is stiff, we get a set of n^f fast modes, f^f, and n^s slow modes, f^s. If the separation is large enough, i.e., $\min (| Λ_{i i}^{f} |) ≫ \max (| Λ_{i i}^{s} |)$ , then the fast modes relax rapidly, and the reduced system consisting of only the slow modes can be evolved over the low-dimensional manifold using larger time increments. While such a transformation in deterministic systems is helpful in removing stiffness from the system and reducing the system dimension, here we are more interested in extracting the physical meaning of the modes, and identifying reactions and/or species that contribute to the fast modes.

Lam [25] defined the mode participation index (PI) of reaction j in mode i as

P_{j}^{i} \equiv \frac{B_{j}^{i} a_{j}}{\sum_{r = 1}^{m} | B_{r}^{i} a_{r} | + O (| b_{i} \cdot Δ y^{*} / Δ t^{*} |)} i = 1, 2, ... n, j = 1, 2, ..., m

(9)

where $B_{j}^{i}$ is given by the dot product b_i ⊙ υ_j. Δy* and Δt* are user-defined values of acceptable error in temporal solution provided by the CSP method. To simplify our analysis, we neglect the second term, O(|b_i·Δy*/Δt*|), that sets the threshold for magnitude of relaxed f. The participation index, $P_{j}^{i}$ , for the fast modes (i = 1, 2, ..., n^f) gives the fractional contribution of reaction j to the fast mode, i.

Herein, the CSP method is used at any stochastic state to understand the instantaneous time scales and appropriately partition the network at that state. This constitutes our second partitioning method. The evolution of the reaction network is still carried out in a stochastic manner, using the HyMSMC method. The Jacobian at a stochastic state can be approximated using the corresponding deterministic model or its variant where the mean-field propensity functions are used instead of the deterministic reaction rates. We propose the latter approach. In addition, we recommend using an analytical Jacobian, if possible, due to its higher accuracy and lower computational cost. Aside from partitioning, the CSP method provides an estimate of the relaxation time of the system that could be used for converging the fast network. Since there are multiple eigenvalues, and thus multiple time scales, in our implementation we choose the smallest eigenvalue (in magnitude) from the n^f fast modes, to relax the fastest dynamics. We discuss this concept further below and in the results section.

Obviously, there is a computational overhead associated with the evaluation of the Jacobian and the eigenvalue analysis. This overhead depends on whether the system is linear (eigenvalue analysis is done only once) or not, the system size and the frequency of carrying out the CSP analysis. Ideally, network partitioning, either CSP-assisted or ranked propensity-based, should be performed prior to every call to the microscopic solver. Practically however, when the trajectory evolves fairly smoothly and gradually, a few calls of the CSP routine during a simulation are sufficient. For the networks considered in this work, the overhead of CSP is negligible in comparison to the cost of a stochastic simulation (see results section). Below, we demonstrate the potential of the CSP technique using several examples and discuss the CPU of the CSP analysis for the most complicated example.

c. Evolution of the slow network

Once the QE PDF of the fast network has been obtained, the slow network is evolved. Appendix A discusses various approximations. The slow-scale approximation [13] evolves the slow network using the slow-scale propensities as the effective transition probabilities of the slow reactions. The slow-scale propensity of reaction $R_{j}^{s}$ is the expectation of the propensity function, $a_{j}^{s}$ (x^f, x^s), over the temporal QE PDF given by

\bar{a_{j}^{s}} (x) = \sum_{x^{f^{'}}} {\hat{P}}_{\infty} (x^{f^{'}} | x) \cdot a_{j}^{s} (x^{f^{'}}, x^{s}),

(10)

where the summation is performed over all the equilibrium states x^f'visited by the fast species [13]. Slow reactions are picked and the elapsed time is computed the same way as in the SSA.

After a slow reaction has been picked, the populations of the species need to be updated. This is not as straightforward, and while critical, has been largely overlooked. A rigorous way of picking slow reactions and subsequently updating the slow network is based on a joint PDF [17] (see Appendix A). In this work, we use the last state present after the relaxation criterion (see section d) has been met. In doing this, we minimize the computational effort and also ensure that the fast state belongs to stationary PDF P_∞(x^f| x). In some cases, the selected slow reaction cannot be executed from some statesof the fast species, e.g., a reactant species with zero population, even though these states might be visited frequently [17]. Care is taken that the chosen slow reaction can be executed from that state by checking the feasibility of firing the chosen slow reaction from the selected state. Failure to satisfy this condition calls for additional simulation of the relaxed fast network until the chosen slow reaction can occur from the current state of fast species or selection from a database of states from P_∞(x^f| x). This eliminates the problem of having negative populations.

d. Relaxation of the Fast Network

The next question is how does one gauge the relaxation of the fast network and decide the time, τ_Eqm, required to obtain an accurate approximation of the stationary PDF, P_→(x^f| x), or the slow-scale propensities, $\bar{a_{j}^{s}}$ (x) ? E et al. [20] decide τ_Eqm via an implicit relation to the deviation of the numerical estimate of the slow-scale propensity from its true value. Salis and Kaznessis [16] relax the fast network for a user-defined number of MC events, $M C E_{E q m}^{f}$ . A predetermined choice of τ_Eqm or $M C E_{E q m}^{f}$ overlooks the likelihood of the relaxation time varying with time, and may make the multiscale simulation inaccurate and potentially inefficient. Thus, we need a more generic relaxation criterion that decides the relaxation time (or number of MC events) of the fast network on-the-fly. This issue is addressed next.

The slow-scale propensity is the first moment (or mean) of the propensity of the slow reaction, evaluated over the entire stationary PDF P_∞(x^f| x). In this work, we test convergence of the slow-scale propensities rather than of the entire PDF. We propose the 2 sample t-test to check for convergence of the slow-scale propensities. The 2 sample t-test is a commonly used statistical tool to verify with a certain degree of confidence, if two independently drawn samples of a random variable come from the same probability distribution. In the current context, the state of the fast species (and also the propensity, $a_{j}^{s}$ (x^f,x^s), of the slow reaction) is the random variable, such that a sample of a size p corresponds to p events of the fast network, with the state at every fast event corresponding to one data point in the sample. The t-statistic for a 2 sample t-test, assuming unequal sample sizes, N₁ and N₂, and unequal variances, is given by

t_{j} = \frac{{\bar{a_{j}^{s}} |}_{1} - {\bar{a_{j}^{s}} |}_{2}}{\sqrt{\frac{{σ_{j}^{2} |}_{1}}{N_{1}} + \frac{{σ_{j}^{2} |}_{2}}{N_{2}}}} \forall j \in R^{s} .

(11)

Here ${\bar{a_{j}^{s}} |}_{1}$ and ${σ_{j}^{2} |}_{1}$ are, respectively, the mean and variance of the propensity $a_{j}^{s}$ evaluated from N₁ = (w - 1)·MCE_win MC events in the first (w-1) simulation windows. ${\bar{a_{j}^{s}} |}_{2}$ and ${σ_{j}^{2} |}_{2}$ are the mean and the variance of the propensity $a_{j}^{s}$ from N₂ = MCE_Win events in the w^th (last) window. We define a window as a short stochastic simulation of the fast network, consisting of MCE_win fast MC events. The slow-scale propensities are converged to a stationary value when

-t_α/2,υ <t_j, <t_α/2,υ, for all j∈R^s,

where t_α/2,υ is the value of the t-statistic for υ degrees of freedom and a significance level of α/2. We use a significance level, α = 0.05, in all the simulations shown in the results section. υ is evaluated as

υ = \frac{{({σ_{j}^{2} |}_{1} / N_{1} + {σ_{j}^{2} |}_{2} / N_{2})}^{2}}{{({σ_{j}^{2} |}_{1} / N_{1})}^{2} / (N_{1} - 1) + {({σ_{j}^{2} |}_{2} / N_{2})}^{2} / (N_{2} - 1)} .

(13)

To simplify the implementation of the above test and to avoid the recurring evaluation of the t_α/2,υ, we assume that υ > 40, and hence fix the critical value at t_α/2,υ ≅ 1.96. Since t_α/2,υ increases with decreasing υ, using t_α/2,υ ≅ 1.96 in cases where υ < 40 imposes a more stringent criterion on the relaxation, and hence can be used safely.

The accuracy of the multiscale method depends on the convergence of the slow-scale propensities. For instance, if the QE PDF is bimodal, then it is crucial that both the branches be sufficiently sampled in evaluating the slow-scale propensities. Also, it is possible that by pure chance, the MCE_win fast events in the first relaxation window do not alter the populations of any fast species participating in the slow network. So the t-test might falsely indicate convergence of slow-scale propensities as a result of inadequate sampling. To avoid such a situation, we suggest that the window length accounts for the size of, and the separation of scales within, the fast network, to ensure that all the fast reactions are adequately sampled.

We propose that the relaxation time, computed via CSP, should be used in conjunction with the relaxation time estimated via the statistical one to ensure sufficient sampling of QE state space. In general, we find good agreement between the statistical and CSP relaxation methods as detailed below in several numerical examples. In general, one should relax the system for the longest of the two times estimated by the CSP and the statistical criterion. Convergence of bistable systems is interesting (and more complex) as discussed in the results section.

The Hybrid Multiscale Monte Carlo (HyMSMC) Method

The new relaxation criterion discussed above enhances the efficiency of the original MSMC method. There are two other areas that can be exploited to further accelerate the method: one is to execute multiple firings at each microscopic time step and the other is to reduce the number of evaluations of the QE PDF. One could reuse the same QE PDF for a few consecutive coarse (macroscopic) time steps, assuming that the occurrence of the slow events does not significantly alter this PDF. This is reminiscent of the τ-leap (TL) methods [6–10], where the leap condition allows one to perform multiple firings of the reactions in a pre-selected leap time. The inherent assumption of the TL methods is that the propensities remain unchanged in the leap time, and hence one can approximate the number of firings in the reaction channels using a Poisson [6, 10] or a binomial random variable [7, 8]. While maintaining the same QE PDF for a few coarse (macroscopic) time steps can potentially result in substantial computational savings, we need a criterion to indicate how long one could maintain the same PDF without sacrificing accuracy. We address this question by using the framework of the hybrid SSA-TL solver [11] to systematically coarse grain the macroscopic solver in time. Since leaping the slow reactions significantly moves the fast network away from its QE, we also propose the hybrid SSA-TL method [11] as the microsolver to speedup the relaxation of the fast network at every coarse time step.

The τ-leap solver [6–8], which temporally coarse-grains the exact SSA, works well only in the limit of large population [7, 8, 11]. At low population of the reactant species, there is a risk of observing negative populations due to the unbounded nature of the Poisson random variable. The binomial τ-leap methods [7, 8] eliminate this problem, but are not as accurate and efficient at low populations. Recently, the hybrid SSA-TL method was introduced by Cao et al. [11], primarily, to reduce the possibility of negative population of the Poisson τ-leap solver. A similar hybrid solver, the partitioned leaping method [12], which combines the next reaction method [5] and the Poisson τ-leap method [6], was introduced by Harris and Clancy. We incorporate the hybrid SSA-TL solver [11] into the MSMC method as the macrosolver and the microsolver. This practically eliminates the likelihood of negative populations while using the explicit Poisson TL method. We call this method the hybrid multiscale Monte Carlo (HyMSMC) method.

The overall concept of HyMSMC is depicted in Figure 1. Integration of the hybrid solver with the multiscale framework begins with classification of the fast (and the slow) reactions into SSA reactions and TL reactions subset, $R_{S S A}^{f}$ and $R_{T L}^{f}$ and ( $R_{S S A}^{s}$ and $R_{T L}^{s}$ ), respectively. All reactions whose reactant(s) population is less than some critical population, X_crit, are defined as SSA reactions [11]. We have successfully used X_crit = 10 in all simulations to obtain accurate results. In principle, the cost and accuracy of the hybrid solver increases with X_crit, with the method reducing to SSA for X_crit → ∞, and to a Poisson TL for X_crit = 0. Generating Poisson random numbers is more expensive than generating uniform random numbers. As a result, there is a X_crit value below which the hybrid solver is more expensive than the SSA. This computational overhead of the hybrid solver should be considered in deciding the optimum X_crit that maximizes the efficiency of the solution.

a. Hybrid Solver as the Microscopic Solver

The objective of using the hybrid solver as the microsolver is to accelerate the relaxation of the fast network via a coarser sampling of the QE state space of the fast species. Only one reaction from the SSA reaction group is executed. The average elapsed time until the occurrence of the next fast reaction belonging to $R_{f}^{S S A}$ is evaluated as [11]

τ_{S S A}^{f} = - \frac{\ln (ξ_{1})}{\sum_{j \in R_{S S A}^{f}} a_{j}^{f}} .

(14)

The hybrid solver executes multiple firings of the TL reactions in a leap. The leap time is estimated using a leap criterion that imposes an upper bound on the mean and the variance of the change in the propensity in this leap time [28]. To evaluate $τ_{T L}^{f}$ , we use the r-criterion (related to the stability of the stochastic model) given by [7]

τ_{T L}^{f} = \min_{i \in S^{f}} {\frac{r x_{i}^{f}}{\sum_{j \in R_{T L}^{f}; υ_{i j}^{f} < 0} - υ_{i j}^{f} a_{j}^{f} (x^{f})}} .

(15)

The time increment τ^f is chosen as

τ^{f} = \min {τ_{T L}^{f}, τ_{S S A}^{f}}

(16)

If $τ_{T L}^{f} \geq τ_{S S A}^{f}$ , we sample a single fast reaction ( $k_{j}^{f}$ = 1) from the subset $R_{S S A}^{f}$ such that j is the smallest integer satisfying

\sum_{i = 1}^{j} a_{i}^{f} \geq ξ_{2} \cdot \sum_{i \in R_{S S A}^{f}} a_{i}^{f} .

(17)

Here ξ₁ and ξ₂ are uniform random numbers, sampled from the unit interval (0,1]. The number of firings, $k_{j}^{f}$ of a TL reaction $R_{j}^{f}$ ∈ $R_{T L}^{f}$ is sampled from a Poisson distribution with mean $a_{j}^{f} τ^{f}$

k_{j}^{f} = P (a_{j}^{f} τ^{f}), f o r a l l j \in R_{T L}^{f} .

(18)

If $τ_{T L}^{f} < τ_{S S A}^{f}$ then only the TL reactions are fired according to Eq. (18).

b. Hybrid Solver as the Macroscopic Solver

Employing the hybrid solver as the macroscopic solver to evolve the slow network replaces the instantaneous value of propensities in the above Eqs. with the slow-scale propensities. For completeness, the time increment used to evolve the slow network is given by

τ^{s} = \min {τ_{T L}^{s}, τ_{S S A}^{s}}

(19)

where

τ_{S S A}^{s} = - \frac{\ln (ξ_{1})}{\sum_{j \in R_{S S A}^{s}} a_{j}^{s}}

(20)

τ_{T L}^{s} = \min_{i \in S} {\frac{r x_{i}}{| \sum_{j \in R_{T L}^{s}; υ_{i j}^{s} < 0} - υ_{i j}^{s} \bar{a_{j}^{s}} |}} .

(21)

If $τ_{S S A}^{s} \leq τ_{T L}^{s}$ , we sample a single reaction ( $k_{j}^{s}$ = 1) from the subset $R_{S S A}^{s}$ , such that j is the smallest integer satisfying

\sum_{i = 1}^{j} \bar{a_{i}^{s}} \geq ξ_{2} \cdot \sum_{i \in R_{S S A}^{s}} \bar{a_{i}^{s}} .

(22)

The number of firings, $k_{j}^{s}$ , of a TL reaction $R_{j}^{s}$ ∈ $R_{T L}^{s}$ is sampled from a Poisson distribution with mean $\bar{a_{j}^{s}}$ τ^s

k_{j}^{s} = P (\bar{a_{j}^{s}} τ^{s}), f o r a l l j \in R_{T L}^{s} .

(23)

If $τ_{S S A}^{s} > τ_{T L}^{s}$ , only the TL reactions are executed according to Eq. (23).

Algorithm Implementation

1.
Initialize the system state X(t = 0) at time t = 0 and the simulation parameters, such as the critical population X_crit, the number of fast MC events, MCE_win, for each relaxation window, the coarse graining factor, r, in the leap criterion, the partitioning criterion, and the significance level a of the statistical test used for the relaxation.
2.
At the state X(t) = (x^f,x^s), evaluate the propensities of all the reactions in the

network. Partition the reaction network into a fast and a slow network, and the species into fast and slow species.

a. Microscopic Solver – Fast Network

3.
Perform MCE_win MC events of the fast network using the hybrid SSA-TL solver. In each event,
a.
Identify the SSA and the TL subsets in the fast network, $R_{S S A}^{f}$ and $R_{T L}^{f}$ , respectively, based on the populations of the participating reactants;
b.
Evaluate the time increment τ^f using (14)-(16);
c.
Execute $k_{j}^{f}$ reactions, $R_{j}^{f}$ ∈ R^fusing (17) and (18);
d.
Update the state of the fast species, $x_{i}^{f} \leftarrow x_{i}^{f} + \sum_{j \in R^{f}} k_{j}^{f} ν_{i j}^{f} f o r i \in S^{f}$
4.
Evaluate the slow-scale propensities $\bar{a_{j}^{s}}$ (see Appendix, Eq. (4)).
5.
If the convergence criterion (12) is satisfied and the time is longer than the relaxation time predicted by CSP, go to 6. Else, go to 3 and perform another MCE_win fast events using the hybrid solver.

b. Macroscopic Solver – Slow Network

6.
Evolve the slow network using the hybrid SSA-TL solver:
a.
Identify the TL and the SSA reactions subsets $R_{T L}^{s}$ and $R_{T L}^{s}$ at the state (x^f, x^s);
b.
Evaluate the time increment τ^s using Eqs. (19)-(21);
c.
Evaluate the number of firing $k_{j}^{s}$ of $R_{j}^{s}$ , using Eqs. (22) and (23);
d.
If any reaction $R_{j}^{s}$ ∈ R^scannot be fired $k_{j}^{s}$ times, execute steps 3a-3d until state X^f= x^fallows $k_{j}^{s}$ executions of all reactions, $R_{j}^{s}$ ∈ R^sor pick a state from a database that can be fired. Else go to 6e;
e.
Update the state of all species, $x_{i} \leftarrow x_{i} + \sum_{i = 1}^{n} k_{j}^{s} ν_{i j}^{s} f o r i \in S$ .
f.
Update the time t = t + τ^s.
7.
If the desired time is reached, stop. Else, go to step 2.

Step 6d is performed to avoid negative populations that may arise from using the stationary PDF to select the representative state of the fast network (see discussion in sub-section c above and Appendix A). Next we demonstrate the strength of the HyMSMC algorithm with the help of simple prototype examples. Real biochemical networks are also considered to demonstrate the generality of the approach to complex networks.

Algorithm Testing

In each of the following examples, we demonstrate the efficiency and accuracy of the proposed HyMSMC algorithm. Simulations were performed on 2.40 GHz Intel^® Xeon(TM) processors. The relaxation of the fast network is assessed with a relaxation tolerance, t_α/2,υ = 1.96 (α = 0.05,υ > 40) and using windows of 25 MC events. The hybrid solver uses a critical population of X_crit = 10 and a leap condition tolerance of r = 0.05. In all the examples shown below, the method is accurate with this parameter set. Each numerical example uniquely serves to highlight other novelties of the new HyMSMC scheme.

a. Coupled Isomerization Reactions

The first reaction network consists of two pairs of reversible isomerization reactions, coupled through a common species B

\begin{array}{l} A ⇄_{k_{- 1}}^{k_{1}} B \\ B ⇄_{k_{- 2}}^{k_{2}} C . \end{array}

(A1)

The above network is one of the simplest networks conducive to QE treatment. The simplicity of this network allows us to clearly demonstrate the key features of the HyMSMC algorithm. It also clarifies the applicability and adaptability of the algorithm to different levels of time-scale and population-scale separation. The rate constants are chosen such that the first reaction pair is much faster than the second reaction pair, and thus it constitutes the fast network. A four orders of magnitude time scale separation exists between the partitioned networks. Based on our definition, species A and B are the fast species and C is the slow species.

Starting with an initial state X_A(t = 0) = 1500, X_B(t = 0) = X_c(t = 0) = 0, we generate stochastic trajectories of network (Al) using the exact SSA, the MSMC method, and the HyMSMC method. As can be seen in Figure 3 and Figure 4, the multiscale methods correctly predict the temporal evolution of the network. The HyMSMC trajectories in Figure 3c and Figure 4c have been generated from states recorded after the occurrence of every slow event, i.e., at every macroscopic step, compared to every millionth MC events in the SSA (Figure 3a and Figure 4a) trajectories. Approximately, 1000 data points have been used to generate all the trajectories seen in Figure 3 and Figure 4. SSA takes around 8 minutes for a single stochastic run from t = 0 to t = 10000 s. The MSMC run is completed in half a minute, and the HyMSMC run, in a fraction of a second. The speedup over the exact SSA is 16 and 320 using the MSMC technique and the HyMSMC technique, respectively. Using number of MC events as a metric, the speedup of the MSMC and HyMSMC methods over the SSA is ~20 and 3600, respectively.

To compare the accuracy of the multiscale methods in a systematic manner, we generate normalized histograms of the states observed at time t = 50 s using 10000 trajectories. Figure 5 shows that the HyMSMC method accurately captures the statistics of the process.

The above example clearly demonstrates the superiority of the HyMSMC method over the MSMC method. The strength of the hybrid scheme stems from its ability to handle large populations that typically render SSA solvers inefficient. The hybrid solver quickly relaxes the fast network, even when the pre-relaxation state is far away from the QE. The computational advantage of the hybrid solver, over the SSA, in relaxing the fast network, is demonstrated by comparing the convergence of the slow-scale propensities in Figure 6. The QE PDF of the fast species B is given by the binomial PDF, $ℬ$ (p,N), where p = k₁/(k₁ + k_-1) and N = (X_A + X_B). The slow-scale propensity of the linear reaction, B → C, is analytically given by k2Np (see dashed line in Figure 6).

Using this network as an example, we also studied the efficiency of the MSMC and the HyMSMC methods as a function of population scales, at a constant time scale separation. Studying the effect of population under overall equilibrium conditions ensures that the net speedup corresponds to a specific average population. The population scales were varied from O(1) to O(10⁶), and in each case we ran the simulation until a certain time and monitored the CPU time and the number of MC events required to complete the run. The parameters in 5 cases are presented in Table 1. The time scale separation is maintained at around 10⁴ in all cases.

Table 1 Rate constants and initial populations in system (Al) used to maintain the time scale separation at ~O(10⁴) as the population X₀ increases.

Full size table

The cost of relaxing the faster network, gauged by the average number of relaxation events per slow event (Figure 7b), increases with increasing population. As a result, the speedup of the MSMC method over the SSA decreases with increasing population (see Figure 7a).

In contrast to the MSMC method, the speedup achieved with the HyMSMC method improves with increasing population. At low populations, the HyMSMC speedup approaches that of the MSMC technique, because the hybrid solver reduces to the SSA at these population levels that are below the critical population limit. At the other extreme, as the population increases, the number of slow events required to reach a certain time approaches 1 (see Figure 7). As a result, the CPU time and the number of MC events required to reach this end time reduce to the requirements for one slow event. Thus, in the limit of high population, the speedup of the HyMSMC over the exact SSA tends to plateau off. By increasing the end time used for the speedup measurement, say from 10⁴ s to 10⁵ s, the leveling of the HyMSMC speedup curve occurs at a higher population (see Figure 7a). In theory, the speedup of the HyMSMC method has a power law dependence on the population scale.

Using the same network, we also studied the effect of network stiffness on the speedup achieved at a constant population. The parameters used for this study are presented in Table 2. The network stiffness is approximated by the ratio of the rate constants, O(k₁/k₂), given that the populations of all species are in the same range (~500). Figure 8 shows that the acceleration achieved with both multiscale schemes has a power law dependence on stiffness. The speedup of the HyMSMC method lies above that of the MSMC method, highlighting the additional speedup achieved through temporal coarse- graining. As the network stiffness reduces to below 10³, the speedup of the MSMC drops below unity, implying that the MSMC method is more expensive than the SSA, and hence should not be used. This is caused by the cost of the MSMC method associated with relaxing the fast network. However, even with this small time-scale separation in the system, the HyMSMC still gives a speedup of around 30 over the SSA due to temporal coarse-graining.

Table 2 Rate constants versus network stiffness in system (A1).

Full size table

Finally, we demonstrate the use of CSP to aid network partitioning. The deterministic dynamics of the coupled isomerization network (Al) can be written as

\begin{array}{l} g_{1} = \frac{d X_{A}}{d t} = - k_{1} X_{A} + k_{- 1} X_{B} \\ g_{2} = \frac{d X_{B}}{d t} = k_{1} X_{A} - (k_{- 1} + k_{2}) X_{B} + k_{- 2} X_{C} \\ g_{3} = \frac{d X_{C}}{d t} = k_{2} X_{B} - k_{- 2} X_{C} . \end{array}

(A2)

Since the system is linear in x, the Jacobian is time-invariant, and thus its eigenvalues and eigenvectors do not evolve with time. The eigenvalues of the Jacobian matrix are Λ₁₁ ≈ -200, Λ₂₂ ≈ -0.015, and Λ₃₃ = 0. The relaxation time of the fast network approximated from the largest in magnitude eigenvalue, (~|5/Λ₁₁|~2.5 × 10^-2), is consistent with the time estimated from the statistical convergence criterion over the course of a simulation (~ 2.5 × 10^-2). The corresponding eigenvectors are

b_{1}^{*} = (\begin{matrix} 0.7071 \\ - 0.7071 \\ 0 \end{matrix}); b_{2}^{*} = (\begin{matrix} 0.4083 \\ 0.4082 \\ - 0.8165 \end{matrix}); b_{3}^{*} = (\begin{matrix} - 0.5774 \\ - 0.5774 \\ - 0.5774 \end{matrix})

(A3)

and the inverse eigenvectors are

\begin{array}{l} b_{1} = [\begin{matrix} 0.7071 & - 0.7071 & 3.5 \times 10^{- 5} \end{matrix}] \\ b_{2} = [\begin{matrix} 0.4083 & 0.4083 & - 0.8165 \end{matrix}] \\ b_{3} = [\begin{matrix} - 0.5774 & - 0.5774 & - 0.5774 \end{matrix}] \end{array}

(A4)

Based on the magnitude of the eigenvalues, mode 1 is the fast mode. The participation indices of all the reactions in the fast mode are evaluated using (9), and are shown in Figure 9. At small times, when the QE approximation is not yet valid, only reaction 1 is identified as a fast reaction. Then, as the population of X_B builds up, the contribution of reaction 2 to the fast mode increases, and eventually reactions 1 and 2 contribute almost equally to the fast mode. Thus, the partitioning of the networks with the CSP analysis agrees with that based on the propensities.

b. A System with Bistability in the Fast Network

Bistability is a common feature of biological networks, and thus, it is important that the approximate stochastic algorithms capture the correct dynamics of a bistable system. One of the most studied examples of bistabilty is the Schlögl model

\begin{array}{r} X_{1} + 2 X ⇄_{k_{- 1}}^{k_{1}} 3 X \\ X_{2} ⇄_{k_{- 2}}^{k_{2}} X . \end{array}

(B1)

The simplicity of this network makes it an ideal prototype to study the validity of the algorithm in the presence of bimodality. In the above network, we assume that species X₁ and X₂ are in excess and act as buffering species. Consequently, the population of species X₁ and X₂ can be combined with the rate constants k₁ and k₂, to give lumped rate constants ${k^{'}}_{1}$ and ${k^{'}}_{2}$ , respectively. The network can be written as

\begin{matrix} 2 X ⇄_{k_{- 1}}^{{k^{'}}_{1}} 3 X & ; & {k^{'}}_{1} = X_{1} k_{1} = 144 {(m o l c - s)}^{- 1}; k_{1} = 6 {(m o l c^{2} - s)}^{- 1} \\ ⇄_{k_{- 2}}^{{k^{'}}_{2}} X & ; & {k^{'}}_{2} = X_{2} k_{2} = 5000 m o l c / s; k_{2} = 1276 s^{- 1} . \end{matrix}

(B2)

At equilibrium, the network displays a noise-induced switching between the two stable steady states. A typical differential equation solver will relax the above network to either of its stable equilibrium states, depending on the initial state. The deterministic steady states can be obtained by solving the cubic steady-state equation for X

\frac{d X}{d t} = \frac{{k^{'}}_{1} X (X - 1)}{2} - \frac{k_{- 1} X (X - 1) (X - 2)}{6} + {k^{'}}_{2} - k_{- 2} X = 0.

(B3)

The reaction rates in (B3) are based on the discrete nature of the reacting species. Solving (B3) for X gives $X_{E q m}^{1}$ = 5, $X_{E q m}^{2}$ = 20 and $X_{E q m}^{3}$ = 50 as the steady state solutions of (B2) with $X_{E q m}^{1}$ and $X_{E q m}^{3}$ being stable and $X_{E q m}^{2}$ being unstable. The equilibrium solution of the above steady state rate equation, using a generic algebraic solver, such as the Newton's method, depends on the initial guess.

The Schlögl model does not display any separation of time scales. So to apply the hybrid multiscale scheme to this example, the Schlögl model was coupled with a pair of slow reversible isomerization reactions

\begin{matrix} 2 X ⇄_{k_{- 1}}^{{k^{'}}_{1}} 3 X & ; & {k^{'}}_{1} = X_{1} k_{1} = 144 {(m o l c - s)}^{- 1}; k_{1} = 6 {(m o l c^{2} - s)}^{- 1} \\ ⇄_{k_{- 2}}^{{k^{'}}_{2}} X & ; & {k^{'}}_{2} = X_{2} k_{2} = 5000 m o l c / s; k_{2} = 1276 s^{- 1} \\ X ⇄_{k_{- 3}}^{k_{3}} Y & ; & k_{3} = 0.0005 s^{- 1}; k_{- 3} = 0.0001 s^{- 1} \end{matrix} .

(B4)

Thus, in our example, the Schlögl model constitutes the fast network, and the solution of Eq. (B3) is the deterministic QE solution of the fast network. The presence of bimodality in the fast network introduces interesting features in the transfer of information between the fast and the slow networks. Figure 10a and Figure 10b show the population trajectories for species X and Y, using the HyMSMC method and the SSA, respectively. Visually, the HyMSMC technique seems to capture the temporal behavior of the network, including the bistability of the species X. The SSA trajectory appears noisier than the HyMSMC one because it records the numerous fast events that are neglected in the HyMSMC plot. Only the states observed at every coarse time step have been recorded in the HyMSMC plot, compared to every 5 millionth point in the SSA trajectory. Based on the CPU time required for a single run, the HyMSMC and MSMC speedups over the SSA are 9000 and 2800, respectively. Based on MC events, the speedups are ~5000 and 600, respectively. As a result of the moderate size of populations in the network, the HyMSMC method has a rather marginal (~3 in CPU time and ~5 in MC events) advantage over the MSMC method.

To assess the accuracy of the HyMSMC technique, we compared the PDFs at 1000 s from 10000 trajectories, with those of the SSA. As was mentioned earlier, by selecting the last state at convergence as the representative state at the macroscopic time step, we are implicitly selecting the fast state from the frequency PDF (see Appendix A) present at that macroscopic time. For an accurate histogram of the fast species at any time, their states need to be sampled from the temporal PDF (see Appendix A) at any state of the slow network. In most cases, sampling from the frequency PDF does not cause any noticeable errors in the histogram of the fast species. However, these sampling-induced discrepancies surface in circumstances involving low populations and/or multimodality. This network exemplifies such a situation. Even though the statistics of the slow species Y is correctly captured (open triangles in Figure 11b), there is a distinct mismatch with the SSA results for the fast species (see filled triangles in Figure 11a). It is possible to obtain the correct probabilistic information of the fast network at any time, provided the slow network has been evolved correctly. To demonstrate this, we performed additional sampling of the fast network to obtain the temporal PDF. We then compute the average of the temporal PDFs at 1000 s using 10000 trajectories to account for the joint probability of observing a slow state, x^s, and the relaxed PDF, PDF (x^s), of the fast species at that slow state. Using this simple procedure, the PDFs of both the species match with those obtained using the SSA (see Figure 11 a, open triangles). This corroborates the fact that QE information of the fast network is embedded in the evolution of the slow species, and can be extracted by sufficient time-average sampling when needed. Usually, we do not have an a priori knowledge of the differences between the temporal and the frequency PDFs. Therefore, we propose that only the temporal PDF should be used.

We further illustrate the role of the correlations of the fast network, in the evolution of the slow network. In the slow-scale SSA (SS-SSA), Cao et al. [13] used the mean-field approximation of the relaxed fast network to evaluate the slow-scale propensity. In the present example, the fast network has 3 steady-state solutions that are independent of the slow network, due to the buffering species in the Schlögl model. As a result, the deterministic equilibrium solutions, X_Eqm, of the fast network are the same at every slow step. Assuming that the same initial guess is given to the solver at every macroscopic time step, the bistability of the fast species will not be identified by the SS-SSA. This error creeps into the slow species evolution too. This can be seen in Figure 11b (filled symbols), where using X_Eqm = 5 or X_Eqm = 50 to evaluate the slow-scale propensities in the SS-SSA results in a wrong PDF of the slow species. This example emphasizes the need for accurate transfer of information (stochastic closure) between scales, and the failure of the mean-field approximation of the slow-scale propensities for bistable systems.

The CSP analysis is based on the analytical Jacobian of the following set of ODEs

\begin{array}{l} g_{1} = \frac{d X}{d t} = \frac{{k^{'}}_{1} X (X - 1)}{2} - \frac{k_{- 1} X (X - 1) (X - 2)}{6} + {k^{'}}_{2} - k_{- 2} X - k_{3} X + k_{- 3} Y \\ g_{2} = \frac{d Y}{d t} = k_{3} X - k_{- 3} Y . \end{array}

(B5)

The two eigenvalues are well separated, indicative of a significant scale separation in the network. Specifically, |Λ₂₂| is ~10^-4 and |Λ₁₁| fluctuates between negative and positive values (this is characteristic of sampling states from the unstable attractor as the system transitions from one branch to the other) but is of the order of 10²-10³. The temporal profile of the participation indices in the fast mode is presented in Figure 12. The combined participation of reactions 5 and 6 is negligible at all times. The individual contribution of reactions 1 to 4 fluctuates a lot, but their combined participation to the fast mode equals ~1, see inset in Figure 12, implying that the fast mode is comprised of the first four reactions. Thus, the partitioning of the two methods (propensity and CSP based) is consistent.

Assessment of convergence in this bistable system is more complicated. The relaxation time estimated via CSP provides the time needed to sample a branch. For bistable systems, aside from two relaxation times, there is also a time scale related to the (residence) time the system stays in a branch. In order to adequately sample both branches, one needs to simulate the system for times longer than all three time scales. The mean residence time of the fast network was estimated to be ~0.01 s on the higher branch (X ~50) and ~0.04 s on the lower branch (X ~5). The relaxation time of the fast network based on magnitude of the eigenvalues lies in the range O(10^-2)-O(10^-3). Thus, in this case, due to the comparable magnitude of residence times and relaxation times, the relaxation time based on the largest in magnitude eigenvalue gives a reasonable estimate of the sampling time of the fast network. In our simulation, we simulated the fast network for an additional 0.1 s, which was decided as a multiple of the sum of the residence times on each of the bifurcation branches.

c. Gene Expression Model: Bistability in the Slow Network

Having established the benefits of the algorithms with simple networks, we focus on real biological networks. The positive feedback-regulated gene expression model proposed in the work of Kepler and Elston [1] mathematically proved that fluctuations between operator states are a potential source of stochasticity in the level of translated protein, even at high mean populations. The gene expression model used by Kepler and Elston is an excellent biological example of "multiscales plaguing stochastic simulation". Unlike the previous example where the bistability was present in the fast network, in this gene expression model, the bistability is rooted in the slow network, and propagates to the fast network.

In the gene expression model shown below, the dimerized form, D, of the gene product, M, activates the gene in a positive feedback manner, by binding its operator site.

\begin{matrix} 2 M ⇄_{k_{- 1}}^{k_{1}} D & k_{1} = 100 {(m o l c - s)}^{- 1}; k_{- 1} = 10^{6} s^{- 1} \\ θ_{0} + D ⇄_{k_{- 2}}^{k_{2}} θ_{1} & k_{2} = 0.15 {(m o l c - s)}^{- 1}; k_{- 2} = 1.5 s^{- 1} \\ θ_{0} \overset{k_{3}}{\to} θ_{0} + M & k_{3} = 50 s^{- 1} \\ θ_{1} \overset{k_{4}}{\to} θ_{1} + M & k_{4} = 1000 s^{- 1} \\ M \overset{k_{5}}{\to} & k_{5} = 1 s^{- 1} . \end{matrix}

(C1)

The separation of scales comes about because the dimerization reaction and its reverse reaction proceed at a much faster rate than the remaining reactions in the network. Hence, in the simulations performed using the exact SSA, most of the simulation time is spent firing the first set of reversible reactions. In simulating 1000 seconds of real time, the HyMSMC algorithm accelerates the simulation by a factor of 140, based on CPU times, and by a factor of 400, based on the number of MC events. The trajectories for the gene product, M, and its dimer, D, are in good visual agreement with those generated using the SSA, as shown in Figure 13 and Figure 14.

PDFs generated at t = 10 s using the HyMSMC method are in excellent agreement with those of the exact SSA (Figure 15). The low population of the dimer exemplifies a situation where the temporal PDF of the fast network is different from the frequency PDF. As a result, the PDF of the dimer species, generated using the HyMSMC method is in disagreement with the SSA PDF at low population states. However, sampling (t_samp = 0.001 s) the fast network at 10 s to collect the temporal PDF, followed by the ensemble average eliminates this discrepancy (see inset in Figure 15b).

As an additional test of accuracy of the HyMSMC method, we compared the distributions of the life-time of the states of the gene (θ₀ = 0 and θ₀ = 1) obtained from a single equilibrium run of the HyMSMC method and the SSA. Figure 16 shows that the HyMSMC method correctly captures the PDF of life time of both states.

Finally, the CSP-analysis shows that there is one predominant eigenvalue (|Λ₁₁| ~10⁶) at all times, implying the presence of one fast mode. The microsolver relaxes the fast network in ~10^-4 time units based on the statistical criterion, which is greater than the eigenvalue-based relaxation time of 5 × l0^-6, implying that our statistical relaxation criterion is more conservative and dominates convergence in this example. The contribution of the reactions to the fast mode is shown in Figure 17. The fast mode almost completely consists of reactions 1 and 2 (see inset in Figure 17). However, the contribution of reaction 2 drops to zero when the D = 0 state is encountered. In such a situation, the CSP-identified fast network will consist of only reaction 1. Even the rank based partitioning faces similar problems in such a situation. One way to tackle the problem is to base the partitioning of the network on average propensity values evaluated from a few (depending on network size) MC events. These averaged propensities could effectively be used to perform the CSP analysis.

d. Heat Shock Response (HSR) Model

As a final example, we apply the algorithm to the heat shock response model in E. Coli. On sensing high temperature, the cell triggers off a network of biochemical reactions at the genetic and cytoplasmic level to prevent or reduce protein denaturation caused by elevated temperatures. The physiological behavior resulting from these reactions is termed the heat shock response (HSR). Takahashi et al. [29] and E et al. [20] used a modified version of the HSR model as an example of a biological network with multiple time scales. The biochemical reactions constituting the HSR model are presented in Table 3. A large separation of time scales is present in this network, with the first 3 reactions shown in Table 3 being around six orders of magnitude faster than the rest. We applied the HyMSMC scheme to the HSR network.

Table 3 Reaction network of the heat shock response model.

Full size table

The transient behavior of the network is captured accurately using the HyMSMC, as can be seen in Figure 18 and Figure 19 which compares trajectories of two species with those of the SSA. Species σ³² and DnaJ were chosen from the fast and slow network, respectively, to demonstrate the accuracy of the algorithm at all scales. Using the HyMSMC method requires 6 s (7 × l0⁵ MC events) to reach t = 300, compared to 1200 s (9 × 10⁸MC events) required by SSA, thus accelerating the simulation by a factor of 200 (1000 based on MC events). As a test of accuracy, the normalized histograms of states at t = 1 s from 1000 trajectories were generated. The results are in good agreement, as shown in Figure 20. The algorithm accurately captures the noise of species σ³² despite the low population involved (see Figure 20b).

The CSP analysis identifies one fast mode in the HSR network, whose eigenvalue is 3 orders of magnitude larger than the next fastest mode. The relaxation time estimated from the eigenvalues (~2.5 × l0^-3) is in good agreement with the one based on the statistical convergence criterion (order of ~1.2 × l0^-3). The temporal profile of the participation indices, shown in Figure 21, identifies the first 3 reactions in the network as the fast reactions that contribute the most to the fastest mode. The combined participation index of reactions 4 to 17 is negligible compared to the participation index of reactions 1 to 3. Thus, reactions 4–17 are the slow reactions. Figure 21 also shows that for the time considered, the network partitioning is maintained. Thus, even for a moderately large network, the CSP method offers a systematic method to partition the network. For the HSR model, the CSP analysis increased the CPU cost of the HyMSMC method by less than 4%. Thus, for the largest network simulated in this paper, the CPU overhead of the CSP-assisted partitioning method is negligible.

Conclusion

In this paper, we have presented a hybrid, multiscale Monte Carlo (HyMSMC) algorithm that can simultaneously handle disparity of timescales and species population in well-mixed stochastic networks. The HyMSMC method uses a hybrid SSA-i-leap method [28] as the microscopic and the macroscopic solvers, and employs stochastic singular perturbation concepts when time scale separation exists. As a result, it is able to systematically exploit large populations in the fast and/or slow networks to accelerate stochastic simulations without losing accuracy. Importantly, the absence of a priori assumptions about the scales in the network makes the HyMSMC method applicable over a wide range of the population and time-scale separation. Consequently, the HyMSMC method can switch between the exact SSA [3, 4], the τ-leap [6], or the MSMC [17] method in a dynamic manner, depending on the instantaneous timescales and populations in the network.

The superiority of the HyMSMC method over the original MSMC method [17] and the SSA [3, 4] was demonstrated using various examples, including two networks displaying bistability. The core of most multiscale algorithms is the closures between scales, which strongly influence accuracy. Using the Schlögl model and a simple gene expression model, we showed that the closures used in the HyMSMC method accurately capture the bimodality in the fast and the slow networks.

A second contribution of this paper is the stochastic partitioning and convergence that have not extensively been discussed in previous work. We introduced the computational singular perturbation (CSP) technique [25, 26] as a diagnostic tool to systematically identify the fast and the slow networks. In all examples, CSP was able to correctly partition the network and provide estimates of relaxation time for converging the fast network. This latter use of eigenvalues in conjunction with statistical tests is strongly recommended in order to eliminate numerical artifacts of the latter. The computational overhead associated with the evaluation of the Jacobian matrix and the eigenvalue analysis has been small for the examples studied here.

Appendix A: Evaluation of PDFs and Slow Scale Propensities

Calculation of PDFs

Once a network has been partitioned, one needs to relax the fast sub-network (microscopic solver) until the stochastic low-dimensional manifold has been reached, then compute the QE PDF of the fast network, and subsequently use it to evolve the slow network. In our original MSMC method [17], this was accomplished using the SSA but this can also be done with the τ-leap method. We compute the time-averaged PDF of the fast states

P_{\infty} (x^{f} | x) \approx \frac{Δ t^{f}}{τ_{E q m}},

(AP1)

where Δt^f is the total life time of the state x^fin an equilibrium simulation of the fast network for time $τ_{E q m} = \sum_{x^{f^{'}}} Δ t^{f^{'}}$ , given that the slow system is at state x^s. We obtain Δt^f by monitoring the life-time of states accessed once at equilibrium.

In the literature, it is common to generate a PDF based on the frequency of observing a state

P_{\infty} (x^{f} | x) \approx \frac{M C E^{f}}{M C E_{E q m}} .

(AP2)

Here, MCE^f is the number of times a state x^fis observed in an equilibrium simulation of MCE_Eqm MC events of the fast network. We refer to the PDFs based on the life-time of states (Eq. (1)) and frequency of states (Eq. (2)) as the temporal and frequency PDF, respectively. In most cases, the frequency and temporal PDFs are practically the same. However, differences arise when the QE PDFs are multimodal and/or when low populations are encountered, as was shown in Ref. [17]. Numerical examples of this paper further indicate that the frequency PDF method can result in erroneous results in such cases, so we strongly recommend the use of the temporal PDF.

Calculation of Slow-Scale Propensities

The propensity function of the slow reaction $R_{j}^{s}$ , given by $a_{j}^{s}$ (x^f,x^s), is a function of the states of the slow and the fast species. Thus, the distribution of X^fresults in a distribution of $a_{j}^{s}$ (x^f,x^s) for all j ∈ $R_{j}^{s}$ . The transition probabilities used to evolve the slow network should be projected onto the QE PDF of the fast network. In Ref. [17], we underscored that the most rigorous way of evolving the slow processes should be based on a joint PDF that accounts for the life time of various fast states along with the rate at which these states are being changed by slow processes.

Evaluating the first moment of the propensity, $a_{j}^{s}$ (x^f,x^s), over the temporal QE PDF, P_∞(x^f| x), is a simpler way to assimilate the stochastic QE description into the dynamics of the slow network [13]. The SS-SSA [13] uses an analytical expression of QE-PDF to evaluate Eq. (10), which restricts the applicability of the algorithm to simple networks. Alternatively, it uses a deterministic, steady-state solution of the fast network to evaluate the slow-scale propensities. As correctly pointed out by Cao et al. [13], this mean-field approach neglects the correlation in the stochastic QE and is accurate at large populations only. In a numerical example, we show that the mean-field approach also fails when the fast network has multiple steady states. Other stochastic multiscale methods [16, 17, 20], which employ the slow-scale approximation, obtain the slow-scale propensities via a numerical approximation, and are, therefore, more general than the SS-SSA, at the expense, of course, of increased computational cost.

Here, we follow Ref. [20] and numerically compute the average propensities instead of the entire PDF. However, we ensure that the state we pick to fire from can indeed be fired, i.e., no negative populations arise (see also main text). This approach effectively accounts for the joint PDF approach of Ref. [17]. More specifically, the slow-scale propensity of each slow reaction can be evaluated as

\bar{a_{j}^{s}} (x) = \frac{\sum_{x^{f^{'}}} Δ t^{f^{'}} \cdot a_{j}^{s} (x^{f^{'}}, x^{s})}{τ_{E q m}}, f o r j = 1, 2, ..., m^{s}

(AP3)

Eq. (3) can be rewritten as follows

\bar{a_{j}^{s}} (x) = \frac{\sum_{p = 1}^{M C E_{E q m}^{f}} {(τ^{f} \cdot a_{j}^{s}) |}_{p}}{τ_{E q m}}, f o r j = 1, 2, ..., m^{s}

(AP4)

where $M C E_{E q m}^{f}$ is the number of MC events executed by the microscopic solver to simulate time (after relaxation has been achieved) $τ_{E q m} = \sum_{p = 1}^{M C E_{E q m}^{f}} τ_{p}^{f}$ of the quasi- equilibrated fast network. ${τ^{f} |}_{p}$ is the time increment in the p^th MC event, and ${a_{j}^{s} |}_{p}$ is the propensity of the j^th slow reaction before the occurrence of the p^th MC event.

References

Kepler TB, Elston TC: Stochasticity in transcriptional regulation: Origins, consequences and mathematical representations. Biophys J. 2001, 81: 3116-3136.
Article PubMed Central CAS PubMed Google Scholar
Elowitz MB, Levine AJ, Siggia ED, Swain PS: Stochastic gene expression in a single cell. Science. 2002, 297 (5584): 1183-1186. 10.1126/science.1070919.
Article CAS PubMed Google Scholar
Gillespie DT: A general method for numerically simulating the stochastic evolution of coupled chemical reactions. Journal of Computational Physics. 1976, 22: 403-434. 10.1016/0021-9991(76)90041-3.
Article CAS Google Scholar
Gillespie DT: Exact stochastic simulation of coupled chemical reactions. Journal of Physical Chemistry. 1977, 81: 2340-2361. 10.1021/j100540a008.
Article CAS Google Scholar
Gibson MA, Bruck J: Efficient exact stochastic simulation of chemical systems with many species and many channels. Journal of Physical Chemistry A. 2000, 104: 1876-1889. 10.1021/jp993732q.
Article CAS Google Scholar
Gillespie DT: Approximate accelerated stochastic simulation of chemically reacting systems. Journal of Chemical Physics. 2001, 115 (4): 1716-1733. 10.1063/1.1378322.
Article CAS Google Scholar
Chatterjee A, Vlachos DG, Katsoulakis MA: Binomial distribution based τ-leap accelerated stochastic simulation. Journal of Chemical Physics. 2005, 122 (2): 024112-10.1063/1.1833357.
Article PubMed Google Scholar
Tian T, Burrage K: Binomial leap methods for simulating stochastic chemical kinetics. Journal of Chemical Physics. 2004, 121 (21): 10356-10364. 10.1063/1.1810475.
Article CAS PubMed Google Scholar
Auger A, Chatelain P, Koumoutsakos P: R-leaping: Accelerating the stochastic simulation algorithm by reaction leaps. Journal of Chemical Physics. 2006, 125 (8):
Rathinam M, Petzold LR, Cao Y, Gillespie DT: Stiffness in stochastically reacting systems: The implicit τ-leaping method. Journal of Chemical Physics. 2003, 119 (24): 12784-12794. 10.1063/1.1627296.
Article CAS Google Scholar
Cao Y, Gillespie DT, Petzold LR: Avoiding negative populations in explicit Poisson tau-leaping. Journal of Chemical Physics. 2005, 123 (5): 054104-10.1063/1.1992473.
Article PubMed Google Scholar
Harris LA, Clancy P: A "partitioned leaping" approach for the multiscale modeling of chemical reaction dynamics. Journal of Chemical Physics. 2006, 125: 144107-10.1063/1.2354085.
Article PubMed Google Scholar
Cao Y, Gillespie DT, Petzold LR: The slow-scale stochastic simulation algorithm. Journal of Chemical Physics. 2005, 122 (1): 14116-10.1063/1.1824902.
Article PubMed Google Scholar
Cao T, Gillespie DT, Petzold LR: Multiscale stochastic simulation algorithm with stochastic partial equilibrium assumption for chemically reacting system. Journal of Computational Physics. 2005, 206: 395-411. 10.1016/j.jcp.2004.12.014.
Article CAS Google Scholar
Sails H, Kaznessis Y: Accurate hybrid stochastic simulation of a system of coupled chemical or biochemical reactions. Journal of Chemical Physics. 2005, 122 (5): 54103-10.1063/1.1835951.
Article Google Scholar
Sails H, Kaznessis Y: An equation-free probabilistic steady-state approximation: Dynamics application to the stochastic simulation of biochemical reaction networks. Journal of Chemical Physics. 2005, 123: 214106-10.1063/1.2131050.
Article Google Scholar
Samant A, Vlachos DG: Overcoming stiffness from stochastic simulation stemming from partial equilibrium: a multiscale Monte Carlo algorithm. Journal of Chemical Physics. 2005, 123 (14): 144114-10.1063/1.2046628.
Article CAS PubMed Google Scholar
Rao CV, Arkin AP: Stochastic chemical kinetics and the quasi-steady-state assumption: Application to the Gillespie algorithm. Journal of Chemical Physics. 2003, 118 (11): 4999-5010. 10.1063/1.1545446.
Article CAS Google Scholar
Haseltine EL, Rawlings JB: Approximate simulation of coupled fast and slow reactions for stochastic chemical kinetics. Journal of Chemical Physics. 2002, 117 (15): 6959-6969. 10.1063/1.1505860.
Article CAS Google Scholar
E W, Liu D, Vanden Eijnden E: Nested stochastic simulation algorithm for chemical kinetic systems with disparate rates. Journal of Chemical Physics. 2005, 123: 194107-10.1063/1.2109987.
Article PubMed Google Scholar
Goutsias J: Quasi-equilibrium approximation of fast reaction kinetics in stochastic biochemical systems. Journal of Chemical Physics. 2005, 122 (18): 184102-10.1063/1.1889434.
Article PubMed Google Scholar
Gillespie DT: The chemical Langevin equation. Journal of Chemical Physics. 2000, 113 (1): 297-306. 10.1063/1.481811.
Article CAS Google Scholar
Vanden-Eijnden E: Numerical techniques for multi-scale dynamical systems with stochastic effects. Comm Math Sci. 2003, 1 (2): 385-391.
Article Google Scholar
Chatterjee A, Vlachos DG: Multiscale spatial Monte Carlo simulations: Multigriding, computational singular perturbation, and hierarchical stochastic closures. Journal of Chemical Physics. 2006, 124 (6): 0641101-06411016. 10.1063/1.2166380.
Article Google Scholar
Lam SH: Using CSP to understand complex chemical kinetics. Combustion Science and Technology. 1993, 89 (5–6): 375-404.
Article CAS Google Scholar
Lam SH: The CSP method of simplifying kinetics. International Journal of Chemical Kinetics. 1994, 26: 461-486. 10.1002/kin.550260408.
Article CAS Google Scholar
Valorani M, Goussis DA, Creta F, Najm HN: Higher order corrections in the approximation of low-dimensional manifolds and the construction of simplified problems with the CSP method. Journal of Computational Physics. 2005, 209: 754-786. 10.1016/j.jcp.2005.03.033.
Article Google Scholar
Cao Y, Gillespie DT, Petzold LR: Efficient step size selection for the τ-leaping simulation method. Journal of Chemical Physics. 2006, 124: 044109-10.1063/1.2159468.
Article PubMed Google Scholar
Takahashi K, Kaizu K, Bin H, Tomita M: A multi-algorithm, multi-timescale method for cell simulation. Bioinformatics. 2004, 20 (4): 538-546. 10.1093/bioinformatics/btg442.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The research was partially supported by the Department of Energy (DE-FG02-04ER25626 and DE-FG02-05ER25702). However, any opinions, findings, conclusions, or recommendations expressed herein are those of the authors and do not necessarily reflect the views of the DOE.

Author information

Authors and Affiliations

Department of Chemical Engineering, University of Delaware, Newark, Delaware, 19716, USA
Asawari Samant, Babatunde A Ogunnaike & Dionisios G Vlachos

Authors

Asawari Samant
View author publications
You can also search for this author in PubMed Google Scholar
Babatunde A Ogunnaike
View author publications
You can also search for this author in PubMed Google Scholar
Dionisios G Vlachos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dionisios G Vlachos.

Additional information

Authors' contributions

AS developed the algorithm, set up and performed the simulations and drafted the manuscript. DGV and BAO conceived the study, and participated in its design and coordination and helped refine the manuscript. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Authors’ original file for figure 14

Authors’ original file for figure 15

Authors’ original file for figure 16

Authors’ original file for figure 17

Authors’ original file for figure 18

Authors’ original file for figure 19

Authors’ original file for figure 20

Authors’ original file for figure 21

Authors’ original file for figure 23

Authors’ original file for figure 24

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Samant, A., Ogunnaike, B.A. & Vlachos, D.G. A hybrid multiscale Monte Carlo algorithm (HyMSMC) to cope with disparity in time scales and species populations in intracellular networks. BMC Bioinformatics 8, 175 (2007). https://doi.org/10.1186/1471-2105-8-175

Download citation

Received: 13 February 2007
Accepted: 24 May 2007
Published: 24 May 2007
DOI: https://doi.org/10.1186/1471-2105-8-175

A hybrid multiscale Monte Carlo algorithm (HyMSMC) to cope with disparity in time scales and species populations in intracellular networks

Abstract

Background

Results

Conclusion

Background

Results and Discussion

Stochastic simulation algorithm (SSA) Notation

Stochastic Quasi-Equilibrium, Network Partitioning, Relaxation, and Evolution

a. Quasi-equilibrium (QE) in stochastic systems

b. Partitioning of a stiff stochastic system

Computational Singular Perturbation (CSP)-assisted stochastic partitioning

c. Evolution of the slow network

d. Relaxation of the Fast Network

The Hybrid Multiscale Monte Carlo (HyMSMC) Method

a. Hybrid Solver as the Microscopic Solver

b. Hybrid Solver as the Macroscopic Solver

Algorithm Implementation

a. Microscopic Solver – Fast Network

b. Macroscopic Solver – Slow Network

Algorithm Testing

a. Coupled Isomerization Reactions

b. A System with Bistability in the Fast Network

c. Gene Expression Model: Bistability in the Slow Network

d. Heat Shock Response (HSR) Model

Conclusion

Appendix A: Evaluation of PDFs and Slow Scale Propensities

Calculation of PDFs

Calculation of Slow-Scale Propensities

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us