Bayesian inference of biochemical kinetic parameters using the linear noise approximation

Komorowski, Michał; Finkenstädt, Bärbel; Harper, Claire V; Rand, David A

doi:10.1186/1471-2105-10-343

Methodology article
Open access
Published: 19 October 2009

Bayesian inference of biochemical kinetic parameters using the linear noise approximation

Michał Komorowski^1,2,
Bärbel Finkenstädt¹,
Claire V Harper⁴ &
…
David A Rand^2,3

BMC Bioinformatics volume 10, Article number: 343 (2009) Cite this article

8194 Accesses
81 Citations
Metrics details

Abstract

Background

Fluorescent and luminescent gene reporters allow us to dynamically quantify changes in molecular species concentration over time on the single cell level. The mathematical modeling of their interaction through multivariate dynamical models requires the deveopment of effective statistical methods to calibrate such models against available data. Given the prevalence of stochasticity and noise in biochemical systems inference for stochastic models is of special interest. In this paper we present a simple and computationally efficient algorithm for the estimation of biochemical kinetic parameters from gene reporter data.

Results

We use the linear noise approximation to model biochemical reactions through a stochastic dynamic model which essentially approximates a diffusion model by an ordinary differential equation model with an appropriately defined noise process. An explicit formula for the likelihood function can be derived allowing for computationally efficient parameter estimation. The proposed algorithm is embedded in a Bayesian framework and inference is performed using Markov chain Monte Carlo.

Conclusion

The major advantage of the method is that in contrast to the more established diffusion approximation based methods the computationally costly methods of data augmentation are not necessary. Our approach also allows for unobserved variables and measurement error. The application of the method to both simulated and experimental data shows that the proposed methodology provides a useful alternative to diffusion approximation based methods.

Background

The estimation of parameters in biokinetic models from experimental data is an important problem in Systems Biology. In general the aim is to calibrate the model so as to reproduce experimental results in the best possible way. The solution of this task plays a key role in interpreting experimental data in the context of dynamic mathematical models and hence in understanding the dynamics and control of complex intracellular chemical networks and the construction of synthetic regulatory circuits [1]. Among biochemical kinetic systems, the dynamics of gene expression and of gene regulatory networks are of particular interest. Recent developments of fluorescent microscopy allow us to quantify changes in protein concentration over time in single cells (e.g. [2, 3]) even with single molecule precision (see [4] for review). Therefore an abundance of data is becoming available to estimate parameters of mathematical models in many important cellular systems.

Single cell imaging techniques have revealed the stochastic nature of biochemical reactions (see [5] for review) that most often occur far from thermodynamic equilibrium [6] and may involve small copy numbers of reacting macromolecules [7]. This inherent stochasticity implies that the dynamic behaviour of one cell is not exactly reproducible and that there exists stochastic heterogeneity between cells. The disparate biological systems, experimental designs and data types impose conditions on the statistical methods that should be used for inference [8–10]. From the modeling point of view the current common consensus is that the most exact stochastic description of the biochemical kinetic system is provided by the chemical master equation (CME) [11]. Unfortunately, for many tasks such as inference the CME is not a convenient mathematical tool and hence various types of approximations have been developed. The three most commonly used approximations are [12]:

1.
The macroscopic rate equation (MRE) approach which describes the thermodynamic limit of the system with ordinary differential equations and does not take into account random fluctuations due to the stochasticity of the reactions.
2.
The diffusion approximation (DA) which provides stochastic differential equation (SDE) models where the stochastic perturbation is introduced by a state dependant Gaussian noise.
3.
The linear noise approximation (LNA) which can be seen as a combination because it incorporates the deterministic MRE as a model of the macroscopic system and the SDEs to approximatively describe the fluctuations around the deterministic state.

Statistical methods based on the MRE have been most widely studied [8, 13–15]. They require data based on large populations. The main advantages of this method are its conceptual simplicity and the existence of extensive theory for differential equations. However, single cells experiments and studies of noise in small regulatory networks created the need for statistical tools that are capable to extract information from fluctuations in molecular species. Few methods used CME to address this. Algorithm, proposed by [16], approximated the likelihood function, the other, suggested by [17] simulated it using Monte Carlo methods. Recently, also a method based on the exact likelihood [18] has been developed. Although, substantial progress has been made in numerical methods for solving CME, inference algorithms based on the CME are computationally intensive and difficult to apply to problems of realistic size and complexity [19]. Another group of methods is based on the DA [9, 20]. This uses likelihood approximation methods (e.g. [21]) that are computationally intensive and require sampling from high dimensional posterior distributions. Inference about the volatility process becomes difficult for low frequency data that are not directly measured at the molecular level [10, 20]. The aim of this study is to investigate the use of the LNA as a method for inference about kinetic parameters of stochastic biochemical systems. We find that the LNA approximation provides an explicit Gaussian likelihood for models with hidden variables and measurement error and is therefore simpler to use and computationally efficient. To account for prior information on parameters our methodology is embedded in the Bayesian paradigm. The paper is structured as follows: We first provide a description of the LNA based modeling approach and then formulate the relevant statistical framework. We then study its applicability in four examples, based on both simulated and experimental data, that clarify principles of the method. Additional file 1 contains details of mathematical and statistical modeling, particularly comparison of the proposed method with an algorithm based on the DA.

Methods

The chemical master equation (CME) is the primary tool to model the stochastic behaviour of a reacting chemical system. It describes the evolution of the joint probability distribution of the number of different molecular species in a spatially homogeneous, well stirred and thermally equilibrated chemical system [11].

Even though these assumptions are not necessarily satisfied in living organisms the CME is commonly regarded as the most realistic model of biochemical reactions inside living cells. Consider a general system of N chemical species inside a volume Ω and let X = (X₁,..., X_N)^Tdenote the number and x = X/Ω the concentrations of molecules. The stoichiometry matrix S = {S_ij}_{i = 1,2...N; j = 1,2...R}describes changes in the population sizes due to R different chemical events, where each S_ijdescribes the change in the number of molecules of type i from X_ito X_i+ S_ijcaused by an event of type j. The probability that an event of type j occurs in the time interval [t, t + dt) equals (x, Ω, t)Ωdt. The functions (x, Ω, t) are called mesoscopic transition rates. This specification leads to a Poisson birth and death process where the probability h(X, t) that the system is in the state X at time t is described by the CME [12] which is given in Additional file 1. It is straightforward to verify that the first order terms of a Taylor expansian of the CME in powers of are given by the following MRE

(1)

where ϕ_i= lim_{Ω→∞, X→∞}X_i/Ω, φ = (ϕ₁,..., ϕ_N)^Tand .

Including also the second order terms of this expansion produces the LNA

(2)

which decomposes the state of the system into a deterministic part φ as solution of the MRE in (1) and a stochastic process ξ described by an Itô diffusion equation

(3)

where W(t) denotes R dimensional Wiener process, and f_i= f_i(φ) (see Additional file 1 for derivation).

The rationale behind the expansion in terms of is that for constant average concentrations relative fluctuations will decrease with the inverse of the square root of volume [22]. Therefore the LNA is accurate when fluctuations are sufficiently small in relation to the mean (large Ω). Hence, the natural measure of adequacy of the LNA is the coefficient of variation i.e. ratio of the standard deviation to the mean (see Additional file 1). Validity of this approximation is also discussed in details in [22, 23]. In addition it can be shown that the process describing the deviation from the deterministic state converges weakly to the diffusion (3) as Ω → ∞ [24]. In order to use the LNA in a likelihood based inference method we need to derive transition densities of the process x.

Transition densities

The LNA provides solutions that are numerically or analytically tractable because the MRE in (1) can be solved numerically and the linear SDE in (3) for an initial condition ξ(t_i) = has a solution of the form [25]

(4)

where the integral is in the Itô sense and (s) is the fundamental matrix of the non-autonomous system of ODEs

(5)

The Itô integral of a deterministic function is a Gaussian random variable [26], therefore equations (4), (5) imply that the transition densities of the process ξ are Gaussian [26] (throughout the paper we use 'Gaussian' or 'normal' shortly to denote either a univariate or a multivariate normal distribution depending on the context)

(6)

where Θ denotes a vector of all model parameters, ψ(·|μ_i-1, Ξ_i-1) is the normal density with mean μ_i-1and covariance matrix Ξ_i-1specified by

(7)

It follows from (2) and (6) that the transition densities of x are normal

(8)

The properties of the normal distribution allow us to construct a convenient inference framework that is reminiscent of the Kalman filtering methodology (see e.g. [27]).

Inference

It is rarely possible to observe the time evolution of all molecular components participating in the system of interest [28]. Therefore, we partition the process x_tinto those components y_tthat are observed and those z_tthat are unobserved.

Let , and denote the time-series that comprise the values of processes x, y and z, respectively, at times t₀,..., t_n. Here and throughout the paper we use the same letter to denote the stochastic process and its realization.

Our aim is to estimate the vector of unknown parameters Θ from a sequence of measurements . The initial condition φ(t₀) is parameterized as an element of Θ. Given the Markov property of the process x the augmented likelihood P(, |Θ) is given by

(9)

where are Gaussian densities specified in (8), and is an initial density assumed to be normal for mathematical convenience. It can then be shown that (see Additional file 1) is Gaussian. Therefore

(10)

where φ(·|φ(t₀),..., φ(t_n), ) is Gaussian density with mean vector (φ(t₀),..., φ(t_n)) and covariance matrix whose elements can be calculated numerically in a straightforward way (see Additional file 1). Since the marginal distributions are also Gaussian it follows that the likelihood function P(|Θ) can be obtained from the augmented likelihood (10)

(11)

where the covariance matrix Σ = {Σ^{(i, j)}}_{i, j = 0,..., n}is a sub-matrix of such that and φ_yis the vector consisting of the observed components of φ.

Fluorescent reporter data are usually assumed to be proportional to the number of fluorescent molecules [29] and measurements are subject to measurement error, i.e. errors that do not influence the stochastic dynamics of the system. We therefore assume that instead of the matrix our data have the form . The parameter λ is a proportionality constant (it is straightforward to generalize for the case with different proportionality constants for different molecular components) and denotes a random vector for additive measurement error. For mathematical convenience we assume that the joint distribution of the measurement error is normal with mean 0 and known covariance matrix Σ_ϵ, i.e. . If measurement errors are independent with a constant variance then . Equation (11) implies that the likelihood function can be written as

(12)

Since for given data the likelihood function (12) can be numerically evaluated, any likelihood based inference is straightforward to implement. Using Bayes' theorem, the posterior distribution P(Θ|) satisfies the relation [30]

(13)

We use the standard Metropolis-Hastings (MH) algorithm [30] to sample from the posterior distribution in (13).

Results and Discussion

In order to study the use of the LNA method for inference we have selected four examples which are related to commonly used quantitative experimental techniques such as measurements based on reporter gene constructs and reporter assays based on Polymerase Chain Reaction (e.g. RT-PCR, Q-PCR). For expository reasons, all case studies consider a model of single gene expression.

Model of single gene expression

Although gene expression involves various biochemical reactions it is essentially modeled in terms of only three biochemical species (DNA, mRNA, protein) and four reaction channels (transcription, mRNA degradation, translation, protein degradation) [31–33]. The stoichiometry matrix has the form

(14)

where rows correspond to molecular species and columns to reaction channels. Let x = (r, p) denote concentrations of mRNA and protein, respectively. For the reaction rates

(15)

we can derive the following macroscopic rate equations

(16)

For the general case it is assumed that the transcription rate k_R(t) is time-dependent, reflecting changes in the regulatory environment of the gene such as the availability of transcription factors or chromatin structure.

Using (14), (15) and (16) in (3) we obtain the following SDEs describing the deviation from the macroscopic state (see section 3.1.4 of Additional file 1 for derivation)

(17)

We will refer to the model in (16) and (17) as the simple model of single gene expression.

In order to test our method on a nonlinear system we will also consider the case of an autoregulated network where the transcription rate of the gene is a function of the concentration of the protein that the gene codes for and where the protein is a transcription factor that inhibits the production of its own mRNA. This is parameterized by a Hill function [31] where k_R(t) now describes the maximum rate of transcription, H is a dissociation constant and n_His a Hill coefficient.

Thus, the nonlinear autoregulatory model the system is described by the MRE

(18)

and the SDEs

(19)

where . We refer to this model as the autoregulatory model of single gene expression. The two models constitute the basis of our inference studies below.

Inference from fluorescent reporter gene data for the simple model of single gene expression

To test the algorithm we first use the simple model of single gene expression. We generate data according to the stoichiometry matrix (14) and rates (15) using Stochastic Simulation Algorithm (SSA) [34] and sample it at discrete time points. We then generate artificial data that are proportional to the simulated protein data with added normally distributed measurement error with known variance . Furthermore we assume that mRNA levels are unobserved. The volume of the system Ω is unknown and we put Ω = 1 so that concentration equals the number of molecules. Thus the data are of the form

(20)

where is the simulated protein concentration, λ is an unknown proportionality constant and is measurement error. For the purpose of our example we model the transcription function by

(21)

This form of transcription corresponds to an experiment, where transcription increases for t ≤ b₃ as a result of being induced by an environmental stimulus and for t > b₃ decreases towards a baseline level b₄.

We assume that at time t₀ (t₀ <<b₃) the system is in a stationary state. Therefore, the initial condition of the MRE is a function of unknown parameters (ϕ_R(t₀), ϕ_P(t₀)) = (b₄/γ_R, b₄k_P/γ_Rγ_P).

To ensure identifiability of all model parameters we assume that informative prior distributions for both degradation rates are available. Priors for all other parameters were specified to be non-informative. To infer the vector of unknown parameters

we sample from the posterior distribution

using the standard MH algorithm. The distribution P(|Θ) is given by (12).

The protein level of the simulated trajectory is sampled every 15 minutes and a sample size of 101 points obtained. We perform inference for two simulated data sets: estimate 1 is based on a single trajectory while estimate 2 represents a larger data set using 20 sampled trajectories (see Figure 1A). All prior specifications, parameters used for the simulations and inference results are presented in Table 1A. Estimate 1 demonstrates that it is possible to infer all parameters from a single, short length time series with a realistically achievable time resolution. Estimate 2 shows that usage of the LNA does not seem to result in any significant bias. A bias has not been detected despite the very small number of mRNA molecules (5 to 35 - Figure 2A in Additional file 1) and protein molecules (100 to 500 - Figure 1A). The coefficient of variation varied between approximately 0.15 and 0.4 for both molecular species (Figure 1 in the Additional file 1).

Table 1 Inference results for (A) the simple model and (B) autoregulatory model of single gene expression

Full size table

Inference for this model required sampling from the 9 dimensional posterior distribution (number of unknown parameters). If instead one used a diffusion approximation based method it would be necessary to sample from a posterior distribution of much higher dimension (see Additional file 1). In addition, incorporation of the measurement error is straightforward here, whereas for other methods it involves a substantial computational cost [20].

Inference from fluorescent reporter gene data for the model of single gene expression with autoregulation

The following example considers the autoregulatory system with only a small number of reacting molecules. Using SSA we generated artificial data from the single gene expression model with autoregulation. The protein time courses were then sampled every 15 minutes at 101 discrete points per trajectory (see Figure 1B). As before we assume that the mRNA time courses are not observed and that the protein data are of the form given in (20), i.e. proportional to the actual amount of protein with additive Gaussian measurement error. As in the previous case study we estimate parameters from two simulated data sets, a single trajectory and an ensemble of 20 independent trajectories. The inference results summarized in Table 1B show that despite the low number of mRNA (0-15 molecules, see Figure 2 in Additional file 1) and protein (10-250 molecules, see Figure B) all parameters can be estimated well with appropriate precision.

Inference for PCR based reporter data

In the case of reporter assays based on Polymerase Chain Reaction (e.g. RT-PCR, Q-PCR) measurements are obtained from the extraction of the molecular contents from the inside of cells. Since the sample is sacrificed, the sequence of measurements are not strictly associated with a stochastic process describing the same evolving unit. Assume that at each time point t_i(i = 0,..n) we observe l measurements that are proportional to the number of RNA molecules either from a single cell or from a population of s cells. This gives a (n + 1) × l matrix of data points

(22)

where is the actual RNA level, λ is the proportionality constant, is a Gaussian independent measurement error indexed by time t_i. j = 1,..., l indexes the l measurements that are taken at time t_i. Note that and are independent random variables as they refer to different cells. We assume that the dynamics of RNA is described by the simple model of single gene expression with LNA equations (16) and (17). Let ϒ_tdenote the distribution of measured RNA at time t (u_t~ ϒ_t). In order to accommodate for the different form of data we modify the estimation procedure as follows. For analytical convenience we assumed that the initial distribution is normal . This together with eq. (8) and normality of measurement error implies that . Simple explicit formulae for μ_tand are derived in Additional file 1. Since all observations are independent we can write the posterior distribution as

(23)

where ψ(·|, ) is the normal density with parameters , . In order to infer the vector of the unknown parameters Θ = (γ_R, λ, b₀, b₁, b₂, b₃, b₄, , ) we sample from the posterior using a standard MH algorithm. To test the algorithm we have simulated a small (l = 10, n = 50, plotted in Figure 2) and a large (l = 100, n = 50) data set using SSA algorithm with parameter values given in Table 2. The data were sampled discretely every 30 minutes and a standard normal error was added. Initial conditions were sampled from the Poisson distribution with mean b₄/γ_R. The estimation results in Table 2 show that parameters can be inferred well in both cases even though the number of RNA molecules in the generated data is very small (about 5-35 molecules). Since subsequent measurements do not belong to the same stochastic trajectory, estimation for the model presented here is not straightforward with the diffusion approximation based methods.

Table 2 Inference results for PCR based reporter assay simulated data

Full size table

Estimation of gfp protein degradation rate from cycloheximide experiment

In this section the method is applied to experimental data. After a period of transcriptional induction, translation of gfp was blocked by the addition of cycloheximide (CHX). Details of the experiment are presented in Additional file 1. Fluorescence was imaged every 6 minutes for 12.5 h (see Figure 2). Since inhibition may not be fully efficient we assume that translation may be occurring at a (possibly small) positive rate k_P. The model with the LNA is

(24)

The observed fluorescence is assumed to be proportional to the signal with proportionality constant λ. For comparison we also consider the diffusion approximation for which an exact transition density can be derived analytically (see Additional file 1 for derivation)

(25)

Since incorporation of measurement error for the diffusion approximation based model is not straightforward, we assume that measurements were taken without any error to ensure fair comparison between the two approaches. Table 3 shows that estimates obtained with both methods are not very different.

Table 3 Inference results for CHX experimental data

Full size table

Conclusion

The aim of this paper is to suggest the LNA as a useful and novel approach to the inference of biochemical kinetics parameters. Its major advantage is that an explicit formula for the likelihood can be derived even for systems with unobserved variables and data with additional measurement error. In contrast to the more established diffusion approximation based methods [9, 20] the computationally costly methods of data augmentation to approximate transition densities and to integrate out unobserved model variables are not necessary. Furthermore, this method can also accommodate measurement error in a straightforward way.

The suggested procedure here is implemented in a Bayesian framework using MCMC simulation to generate posterior distributions. The LNA has previously been studied in the context of approximating Poisson birth and death processes [22–24, 35] and it was shown that for a large class of models the LNA provides an excellent approximation. Furthermore, in [35] it is shown that for the systems with linear reaction rates the first two moments of the transition densities resulting from the CME and the LNA are equal. Here we propose using the LNA directly for inference and provide evidence that the resulting method can give very good results even if the number of reacting molecules is very small. In our previous study [10] we have presented differences between fitting deterministic and stochastic models, where we used diffusion approximation based method. Our experience from that work and from study [20] is that implementation of diffusion approximation based methods is challenging especially for data that are sparsely sampled in time because the need for imputation of unobserved time points leads to a very high dimensionality of the posterior distribution. This usually results in highly autocorrelated traces affecting the speed of convergence of the Markov chain. Our method considerably reduces the dimension of the posterior distribution to the number of unknown parameters of a model only and is independent of the number of unobserved components (see Additional file 1). Nevertheless it can only be applied to the systems with sufficiently large volume, where fluctuations around a deterministic state are relatively close to the mean.

References

Ehrenberg M, Elf J, Aurell E, Sandberg R, Tegner J: Systems Biology Is Taking Off. Genome Res 2003, 13(11):2377–2380. 10.1101/gr.1763203
Article CAS PubMed Google Scholar
Elowitz MB, Levine AJ, Siggia ED, Swain PS: Stochastic Gene Expression in a Single Cell. Science 2002, 297(5584):1183–1186. 10.1126/science.1070919
Article CAS PubMed Google Scholar
Nelson DE, Ihekwaba AEC, Elliott M, Johnson JR, Gibney CA, et al.: Oscillations in NF-kappaB Signaling Control the Dynamics of Gene Expression. Science 2004, 306(5696):704–708. 10.1126/science.1099962
Article CAS PubMed Google Scholar
Xie SX, Choi PJ, Li GW, Lee NK, Lia G: Single-Molecule Approach to Molecular Biology in Living Bacterial Cells. Annual Review of Biophysics 2008, 37: 417–444. 10.1146/annurev.biophys.37.092607.174640
Article CAS PubMed Google Scholar
Raser JM, O'Shea EK: Noise in Gene Expression: Origins, Consequences, and Control. Science 2005, 309(5743):2010–2013. 10.1126/science.1105891
Article PubMed Central CAS PubMed Google Scholar
Keizer J: Statistical Thermodynamics of Nonequilibrium Processes. Springer, New York; 1987.
Book Google Scholar
Guptasarma P: Does replication-induced transcription regulate synthesis of the myriad low copy number proteins of Escherichia coli? Bioessays 1995, 17(11):987–97. 10.1002/bies.950171112
Article CAS PubMed Google Scholar
Moles CG, Mendes P, Banga JR: Parameter Estimation in Biochemical Pathways: A Comparison of Global Optimization Methods. Genome Res 2003, 13(11):2467–2474. 10.1101/gr.1262503
Article PubMed Central CAS PubMed Google Scholar
Golightly A, Wilkinson DJ: Bayesian Inference for Stochastic Kinetic Models Using a Diffusion Approximation. Biometrics 2005, 61(3):781–788. 10.1111/j.1541-0420.2005.00345.x
Article CAS PubMed Google Scholar
Finkenstadt B, Heron E, Komorowski M, Edwards K, Tang S, Harper C, Davis J, White M, Millar A, Rand D: Reconstruction of transcriptional dynamics from gene reporter data using differential equations. Bioinformatics 2008, 24(24):2901. 10.1093/bioinformatics/btn562
Article PubMed Central CAS PubMed Google Scholar
Gillespie DT: A Rigorous Derivation of the Chemical Master Equation. Physica A 1992, 188(1–3):404–425. 10.1016/0378-4371(92)90283-V
Article CAS Google Scholar
Van Kampen N: Stochastic Processes in Physics and Chemistry. North Holland. 2006.
Google Scholar
Mendes P, Kell D: Non-linear optimization of biochemical pathways: applications to metabolic engineering and parameter estimation. Bioinformatics 1998, 14(10):869–883. 10.1093/bioinformatics/14.10.869
Article CAS PubMed Google Scholar
Ramsay JO, Hooker G, Campbell D, Cao J: Parameter estimation for differential equations: a generalized smoothing approach. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2007, 69(5):741–796. 10.1111/j.1467-9868.2007.00610.x
Article Google Scholar
Esposito W, Floudas C: Global Optimization for the Parameter Estimation of Differential-Algebraic Systems. Industrial and Engineering Chemistry Research 2000, 39(5):1291–1310. 10.1021/ie990486w
Article CAS Google Scholar
Reinker S, Altman R, Timmer J: Parameter estimation in stochastic biochemical reactions. Systems Biology, IEE Proceedings 2006, 153(4):168–178. 10.1049/ip-syb:20050105
Article CAS Google Scholar
Tian T, Xu S, Gao J, Burrage K: Simulated maximum likelihood method for estimating kinetic rates in gene expression. Bioinformatics 2007, 23: 84. 10.1093/bioinformatics/btl552
Article CAS PubMed Google Scholar
Boys R, Wilkinson D, Kirkwood T: Bayesian inference for a discretely observed stochastic kinetic model. Statistics and Computing 2008, 18(2):125–135. 10.1007/s11222-007-9043-x
Article Google Scholar
Wilkinson D: Stochastic modelling for quantitative description of heterogeneous biological systems. Nature Reviews Genetics 2009, 10(2):122–133. 10.1038/nrg2509
Article CAS PubMed Google Scholar
Heron EA, Finkenstadt B, Rand DA: Bayesian inference for dynamic transcriptional regulation; the Hes1 system as a case study. Bioinformatics 2007, 23(19):2596–2603. 10.1093/bioinformatics/btm367
Article CAS PubMed Google Scholar
Elerian O, Chib S, Shephard N: Likelihood Inference for Discretely Observed Nonlinear Diffusions. Econometrica 2001, 69(4):959–993. 10.1111/1468-0262.00226
Article Google Scholar
Elf J, Ehrenberg M: Fast Evaluation of Fluctuations in Biochemical Networks With the Linear Noise Approximation. Genome Res 2003, 13(11):2475–2484. 10.1101/gr.1196503
Article PubMed Central CAS PubMed Google Scholar
Lars F, Per L, Andreas H: A Hierarchy of Approximations of the Master Equation Scaled by a Size Parameter. Journal of Scientific Computing 2007, 34(2):127–151.
Google Scholar
Kurtz TG: The Relationship between Stochastic and Deterministic Models for Chemical Reactions. The Journal of Chemical Physics 1972, 57(7):2976–2978. 10.1063/1.1678692
Article CAS Google Scholar
Arnold L: Stochastic differential equations: theory and applications. Wiley-Interscience; 1974.
Google Scholar
Oksendal B: Stochastic differential equations: an introduction with applications. 3rd edition. Springer; 1992.
Book Google Scholar
Brockwell P, Davis R: Introduction to time series and forecasting. Springer New York; 2002.
Book Google Scholar
Ronen M, Rosenberg R, Shraiman BI, Alon U: Assigning numbers to the arrows: Parameterizing a gene regulation network by using accurate expression kinetics. Proceedings of the National Academy of Sciences of the United States of America 2002, 99(16):10555–10560. 10.1073/pnas.152046799
Article PubMed Central CAS PubMed Google Scholar
Wu JQ, Pollard TD: Counting Cytokinesis Proteins Globally and Locally in Fission Yeast. Science 2005, 310(5746):310–314. 10.1126/science.1113230
Article CAS PubMed Google Scholar
Gamerman D, Lopes HF: Markov Chain Monte Carlo Stochastic Simulation for Bayesian Inference. 2nd edition. Chapman & Hall/CRC; 2006.
Google Scholar
Thattai M, van Oudenaarden A: Intrinsic noise in gene regulatory networks. Proceedings of the National Academy of Sciences 2001. 151588598 151588598
Google Scholar
Chabot JR, Pedraza JM, Luitel P, van Oudenaarden A: Stochastic gene expression out-of-steady-state in the cyanobacterial circadian clock. Nature 2007, 450: 1249–1252. 10.1038/nature06395
Article CAS PubMed Google Scholar
Komorowski M, Miekisz J, Kierzek A: Translational Repression Contributes Greater Noise to Gene Expression than Transcriptional Repression. Biophysical Journal 2009., 96(2): 10.1016/j.bpj.2008.09.052
Gillespie DT: Exact stochastic simulation of coupled chemical reactions. Journal of Physical Chemistry 1977, 81(25):2340–2361. 10.1021/j100540a008
Article CAS Google Scholar
Ryota T, Hidenori K, J KT, Kazuyuki A: Multivariate analysis of noise in genetic regulatory networks. Journal of Theoretical Biology 2004, 229(4):501–521. 10.1016/j.jtbi.2004.04.034
Article Google Scholar

Download references

Acknowledgements

This research was funded by BBSRC SABR grant BB/F005814/1 and EU BIOSIM Network Contract 005137. DAR is funded by EPSRC Senior Research Fellowship EP/C544587/1 and MK by studentship, Dept of Statistics, University of Warwick. CVH was funded by Wellcome Trust Programme Grant (067252, to JRED and MRHW) and now is recipient of The Prof. John Glover Memorial Postdoctoral Fellowship.

Author information

Authors and Affiliations

Department of Statistics, University of Warwick, Coventry, UK
Michał Komorowski & Bärbel Finkenstädt
Systems Biology Centre, University of Warwick, Coventry, UK
Michał Komorowski & David A Rand
Mathematics Institute, University of Warwick, Coventry, UK
David A Rand
Department of Biology, University of Liverpool, Liverpool, UK
Claire V Harper

Authors

Michał Komorowski
View author publications
You can also search for this author in PubMed Google Scholar
Bärbel Finkenstädt
View author publications
You can also search for this author in PubMed Google Scholar
Claire V Harper
View author publications
You can also search for this author in PubMed Google Scholar
David A Rand
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michał Komorowski.

Additional information

Authors' contributions

MK proposed and implemented the algorithm. CVH performed the cycloheximide experiment. MK wrote the paper with assistance from BF and DAR, who supervised the study.

Electronic supplementary material

12859_2009_3073_MOESM1_ESM.PDF

Additional file 1: Supplemental information. Supplementary information contains derivation of the theoretical results, details about algorithm implementation and comparison with the inference method based on the diffusion approximation. (PDF 515 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Komorowski, M., Finkenstädt, B., Harper, C.V. et al. Bayesian inference of biochemical kinetic parameters using the linear noise approximation. BMC Bioinformatics 10, 343 (2009). https://doi.org/10.1186/1471-2105-10-343

Download citation

Received: 29 January 2009
Accepted: 19 October 2009
Published: 19 October 2009
DOI: https://doi.org/10.1186/1471-2105-10-343

Bayesian inference of biochemical kinetic parameters using the linear noise approximation

Abstract

Background

Results

Conclusion

Background

Methods

Transition densities

Inference

Results and Discussion

Model of single gene expression

Inference from fluorescent reporter gene data for the simple model of single gene expression

Inference from fluorescent reporter gene data for the model of single gene expression with autoregulation

Inference for PCR based reporter data

Estimation of gfp protein degradation rate from cycloheximide experiment

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Electronic supplementary material

12859_2009_3073_MOESM1_ESM.PDF

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

About this article

Cite this article

Keywords

BMC Bioinformatics

Contact us

Bayesian inference of biochemical kinetic parameters using the linear noise approximation

Abstract

Background

Results

Conclusion

Background

Methods

Transition densities

Inference

Results and Discussion

Model of single gene expression

Inference from fluorescent reporter gene data for the simple model of single gene expression

Inference from fluorescent reporter gene data for the model of single gene expression with autoregulation

Inference for PCR based reporter data

Estimation of gfp protein degradation rate from cycloheximide experiment

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Electronic supplementary material

12859_2009_3073_MOESM1_ESM.PDF

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us