# Identification of models of heterogeneous cell populations from population snapshot data

- Jan Hasenauer
^{1}Email author, - Steffen Waldherr
^{1}, - Malgorzata Doszczak
^{2}, - Nicole Radde
^{1}, - Peter Scheurich
^{2}and - Frank Allgöwer
^{1}

**12**:125

**DOI: **10.1186/1471-2105-12-125

© Hasenauer et al; licensee BioMed Central Ltd. 2011

**Received: **30 September 2010

**Accepted: **28 April 2011

**Published: **28 April 2011

## Abstract

### Background

Most of the modeling performed in the area of systems biology aims at achieving a quantitative description of the intracellular pathways within a "typical cell". However, in many biologically important situations even clonal cell populations can show a heterogeneous response. These situations require study of cell-to-cell variability and the development of models for heterogeneous cell populations.

### Results

In this paper we consider cell populations in which the dynamics of every single cell is captured by a parameter dependent differential equation. Differences among cells are modeled by differences in parameters which are subject to a probability density. A novel Bayesian approach is presented to infer this probability density from population snapshot data, such as flow cytometric analysis, which do not provide single cell time series data. The presented approach can deal with sparse and noisy measurement data. Furthermore, it is appealing from an application point of view as in contrast to other methods the uncertainty of the resulting parameter distribution can directly be assessed.

### Conclusions

The proposed method is evaluated using artificial experimental data from a model of the tumor necrosis factor signaling network. We demonstrate that the methods are computationally efficient and yield good estimation result even for sparse data sets.

## Background

The main goals of research in systems biology are the development of quantitative models of intracellular pathways and the development of tools to support the modeling process. Thereby, most of the available methods and models consider only a single "typical cell" whereas most experimental data used to calibrate the models are obtained using cell population experiments, e.g. western blotting. This yields problems in particular if the studied population shows a large cell-to-cell variability. In such situations inferring a single cell model from cell population data can lead to biologically meaningless results. In order to understand the dynamical behavior of heterogeneous cell populations, it is crucial to develop cell population models, describing the whole population and not only a single individual [1–4].

This has already been realized by several authors, and it has been shown that stochasticity in biochemical reactions and unequal partitioning of cell material at cell division can lead to complex population dynamics [1–5], such as bimodal distributions. Besides these sources for heterogeneity also genetic and epigenetic differences have to be considered [6].

For the purpose of this paper, heterogeneity in populations is modeled by differences in parameter values and initial conditions of the model describing the single cell dynamics [4, 7, 8]. The network structure is assumed to be identical in all cells. The distribution of the parameter values within the cell population is described by a multi-variate probability density function, which is part of the population model. This modeling framework is well suited for modeling genetic and epigenetic differences among cells [2, 4, 7].

In the following, the problem of estimating the probability density of the parameters is studied. Therefore, we employ population snapshot data (PSD), which provide single cell measurements at every time instance but which do not provide single cell time series data. A typical experimental setup which provides PSD is flow cytometric analysis. In general, PSD are a common data type in the experimental analysis of biological systems.

So far, there are not many methods available for the estimation of parameter distributions. In pharmacokinetic studies mixed effect models [9] are used frequently. Unfortunately, as in the problem we consider the number of individuals is very large (> 10^{4}) and the amount of information per individual very limited (often only one data point), these methods are computationally too demanding. Furthermore, as in this study we are particularly interested in intracellular signal transduction, also methods which purely focus on the population balance [10–12] cannot be employed. In [8, 13, 14] methods are proposed which can in principle deal with the problem at hand. There, the considered estimation problem has been formulated as a convex optimization problem. Unfortunately, these methods either require an extensive amount of measurement data [8, 13], and/or do not allow considering prior knowledge [8, 13, 14]. Additionally, no methods to evaluate the reliability of the estimates are provided.

In this paper a novel Bayesian approach [15, 16] for inferring the parameter density will be introduced. The approach is mainly based on the maximum likelihood methods presented in [13, 14], but can deal with sparse and noisy single cell data in addition to realistic measurement noise models. Furthermore, one may directly access the remaining uncertainty of the estimation result and the prediction uncertainties via the calculation of Bayesian confidence intervals [17, 18]. It is shown that the posterior distribution can be determined efficiently employing a parameterization of the parameter density in combination with commonly used Markov chain Monte Carlo (MCMC) sampling techniques [19].

To illustrate the properties of the proposed methods, a mathematical model of the tumor necrosis factor (TNF) pathway [20] is analyzed using artificial experimental data.

## Methods

### Problem statement

#### Cell population model

with state variables
, output variables
, and parameters
. The vector field
is Lipschitz continuous and the functions
and
are continuous. If for example the concentration
is measured via flow cytometry, we would have
, where *c* is a proportionality factor. The index *i* specifies the individual cells within the population. The parameters *θ*^{(i)}can be kinetic constants, e.g. reaction rates or binding affinities.

*N*cells,

*i*is subject to the probability distribution

Note that interactions among individual cells influencing the analyzed pathway are not allowed. This is a restriction but indeed fulfilled in many *in vitro* lab experiments.

#### Measurement data and noise

In this paper we consider experimental setups where measurement data are obtained in the form of population snapshots, e.g. via flow cytometry. Population snapshots are taken at different times *t*_{
j
}, and the *j* th snapshot contains measurements for the output *y* of *M*_{
j
} cells. Due to experiment setup, it can be assumed that any cell is present at most in one snapshot.

The cells in the individual snapshots are referenced through index sets: snapshot *j* contains all cells from the index set
, with *M*_{0} = 0. Thereby,
in which
is the number of snapshots.

*t*

^{(i)}is the time at which the measurement was taken, and is the measured, noise-distorted output as defined below. If cell

*i*has been measured as part of the

*j*th snapshot, then

*t*

^{(i)}=

*t*

_{ j }. The snapshot

*j*is the set of all data points with

*i*∈

*I*

_{ j }, as depicted in Figure 1:

in which is the total number of measured cells.

We emphasize that experimental setups are considered in which cells are not tracked over time. These setups are very common in studies on the population scale. Classical examples for measurement techniques yielding such data are flow cytometric analysis and cytometric fluorescence microscopy. These measurement techniques allow to determine protein concentrations within single cells. As the population is well mixed when the measurement is performed and no cell is measured more than once, the individual single cell measurements
are independent. This independence of
and
(respectively
and
, *i*_{1} ≠ *i*_{2}, holds if both cells are measured during one snapshot
as well as if the cells are measured within different snapshots
.

*y*

^{(i)}is the actual output from (1). The multiplicative noise is denoted by η

^{×}∈ ℝ

^{ m }and the additive noise s denoted by η

^{+}∈ ℝ

^{ m }. Both,

*η*

^{×}and

*η*

^{+}are in the following assumed to be vectors of log-normally distributed random variables with probability density functions

for all *j* = {×, +}, *k* = 1, ..., *m*. This noise model allows the study of relative and absolute measurement noise and describes the commonly seen noise distributions in biological experiments [21].

#### Problem formulation

As mentioned previously, when studying heterogeneous populations the density of the parameters Θ is in general unknown but necessary to gain an in-depth understanding of the population dynamics. Therefore, we are concerned with the problems:

**Problem 1** *Given the measurement data*
, *the cell population model* Σ_{pop}, *and the noise model* (8)*, infer the parameter density* Θ*.

**Problem 2** *Given the measurement data*
, *the cell population model* Σ_{pop}, *and the noise model* (8)*, determine the uncertainty of the estimated parameter density* Θ*.

Unfortunately, the number of cells considered in a standard lab experiment is on the order of 10^{4} to 10^{7}. Thus, simulating the population model (2) is computationally expensive. Furthermore, it is hard from a theoretical point of view to deal with ensemble models such as (2). Density-based descriptions of the population dynamics are far more appealing for solving Problem 1 and 2. Therefore, in the next section a density description of the population is introduced.

### Density-based modeling of cell populations

in which
is an arbitrary subset of the output space. Hence, *y*^{(i)}(*t*) can be interpreted as a random variable which is distributed according to ϒ(*y*|*t*,Θ).

*y*|

*t*,Θ) for a given Θ,

*S*independent single cell trajectories

*y*

^{(i)}(

*t*) of the cell population (2) are calculated. The parameters for these cells are drawn from Θ and the initial conditions are computed according to

*x*

_{0}(

*θ*

^{(i)}). This yields with distribution of

*y*

^{(i)}(

*t*) depending on Θ. Given an approximation of ϒ is

are used to conserve the property that all variables are positive. The positive definite matrix *H* is used to select the width of the kernel
, and is chosen using the available rule of thumb described in [23]. The selection of *H* is crucial for achieving a good approximation of the probability density, and depends strongly on *S*.

Approaches similar to the one we use to approximate ϒ(*y*|*t*,Θ) are employed in [13, 14], in [8] with a naive density estimator and in [24] in the context of cell migration.

### Estimation of the parameter density

In the previous section an approach to determine the output density ϒ within the cell population for a given parameter density Θ is presented. Based on this an approach for estimating Θ from the available data is developed next.

#### Bayes' theorem for heterogeneous cell populations

*p*(Θ) is the prior probability of Θ, is the conditional probability of observing given Θ, is the posterior probability of Θ given , and is the marginal probability of . As the different single cell measurements are independent (14) can be written as

*i*

_{1}≠

*i*

_{2}, it is not necessary to distinguish between the cases that (1) the two cells are measured at the same instance and that (2) the two cells are measured at different time instances . Hence, merely the conditional probability of each individual single cell measurement has to be determined. For the considered process the can be determined using the output density ϒ,

*θ*,

in which
, with *y*(*t*^{(i)}, *θ*) being the output at time *t*^{(i)}for a cell having parameters *θ*. This reformulation of (16) is possible as ϒ directly depends on Θ. This step simplifies the evaluation of
tremendously.

Based on (15) and (17), the calculation of the posterior probability for a given probability density of the parameters Θ is possible. Unfortunately, the inference problem nevertheless cannot be solved directly, as Θ is element of a function space, and hence further steps are necessary.

#### Parameterization of parameter density

_{ j },

The ansatz functions
are themselves probability densities with
. The weighting vector is denoted by
, where *n*_{
φ
} is the number of ansatz functions and
to guarantee that Θ_{
φ
} is a probability density. The weightings *φ*_{
j
} can be interpreted as parameters determining the probability density Θ_{
φ
} and are for the remainder also called density parameters.

Note that the method presented in the following is independent of the choice of ansatz functions. Nevertheless, a clever choice of the ansatz functions may improve the approximation of the true parameter density dramatically. In this work, the ansatz functions are chosen to be multivariate Gaussians.

in which ϒ (*y*|*t*, Λ_{
j
}) is the output density obtained for single cell parameters distributed according to Λ_{
j
}. This representation of the response is possible as the output density fulfills the superposition principle with respect to the parameter distribution Θ_{
φ
}. This reformulation has the advantage that computing the output density for arbitrary density parameters *φ* only requires the non-recurring computation of the responses ϒ (*y*|*t*, Λ_{
j
}) and summation of those.

#### Reformulation of posterior probability

_{ φ }and ϒ (

*y*|

*t*, Θ

_{ j }) the conditional probability may be parameterized and expressed as the weighted sum,

_{ j }. As the ansatz functions are predefined the conditional probability can be evaluated,

in which *θ*^{(k)}is drawn from Λ_{
j
}, *θ*^{(k)}~ Λ_{
j
}, and *S*_{
c
} is the total size of the Monte Carlo sample {*θ*^{(k)}}_{
k
}. If Λ_{
j
} allows for an efficient drawing of samples, the computational cost of determining
is reasonable, requiring *S*_{
c
} simulations of the single cell model (1).

*φ*, the conditional probability can be simplified to

_{ φ }being a probability density. Note that for parameter estimation often only the shape of the posteriori probability density is of interest, and not the normalization. Therefore, we only consider

in which is the unnormalized posterior probability. Sampling from and will yield the same distribution of sample members. Furthermore, and have the same extrema.

#### Computation of maximum posterior probability estimate

Given the simplified unnormalized posterior probabilities
one important question is which parameter density Θ_{
φ
} maximizes
. This is the most likely parameter density for the measured data and the prior knowledge.

_{ j }is a probability density, is positive if all

*φ*

_{ j }are positive. Employing this and optimizing - instead of , (30) can be simplified to

This minimization problem can for concave *p*(Θ_{
φ
}) be solved rather efficiently, as in such case (31) is a convex optimization problem [25]. For this problem solvers exist which guarantee convergence to the global optimum in polynomial time, e.g the interior point method [25].

#### Uncertainty of parameter density

In the previous section a method is presented which allows the computation of the maximum posterior probability estimate . As measurement data are limited and noise corrupted this estimate will not be the true parameter density. Hence, the uncertainty of the parameter density has to be evaluated.

##### Sampling of posterior probability density

In order to analyze the uncertainty of the estimate, a sample of the posterior probability density is generated. This is possible, as the unnormalized posterior probability of a distribution can be evaluated efficiently given (24) - (28). In this work the sampling is performed with a classical Metropolis-Hastings method [19]. Also Gibbs or slice sampling approaches may be employed. Compared to importance and rejection sampling these methods are well suited as they do not require the selection of an appropriate proposal density, a task which is difficult in this case.

The main point of concern when using MCMC sampling for the problem at hand is that the prior probability and the posterior probability respectively are only non-zero on a (*n*_{
φ
}- 1) -dimensional subset of the density parameter space (28). This is due to the fact that the sum over the elements of *φ* has to be one for Θ_{
φ
} being a probability density. If a standard proposal step was used, the acceptance rate would have been zero.

*n*

_{ φ }- 1)-dimensional space, , and computing the remaining density parameter via the closing condition . According to this the update step for

*φ*consists of two steps:

- 1.
- 2.

In this work, the reduced proposal density is chosen to be a multivariate normal distribution, , with covariance matrix .

*φ*

^{k+1}~

*T*(

*φ*

^{k+1}|

*φ*

^{ k }). The proposed density parameter vector

*φ*

^{k+1}is accepted with probability

The distinction of the two cases is hereby crucial to ensure that only probability densities which are greater than zero for all are accepted.

By combining update and acceptance step one obtains an algorithm which draws a sample of weighting vectors
, or respectively parameter densities
, from the posterior distribution. The number of sample members is thereby *S*_{
φ
}. The pseudo code for the routine is given in Algorithm 1. In particular, the facts that

ensure hereby an efficient sampling. Without this reformulation the integral defining the conditional probability would have to be computed in each update step. The resulting computational effort would be very large.

**Algorithm 1** Sampling of posteriori distribution

**Require:** data
, prior *p*(Θ_{
φ
}), model *p*(*y*|*θ* ), ansatz functions
, initial point *φ*^{0}.

Calculation of conditional probabilities
employing *p*(*y*|*θ* ).

Initialize the Markov Chain with *φ*^{0}.

**for** *k* = 1 to *S*_{
φ
}**do**

Given *φ*^{
i
} propose *φ*^{k+1}from proposal density *T* (*φ*^{k+1}|*φ*^{
k
}).

Calculate posterior probability .

Generate uniformly distributed random number *r* ∈ [0,1].

**if** *r* <*p*_{
a
}(*φ*^{k+1}|*φ*^{
k
}) **then**

Accept proposed parameter vector *φ*^{k+1}.

**else**

Restore previous parameter vector, *φ*^{k+1}= *φ*^{
k
}.

**end if**

**end for**

**end**

##### Bayesian confidence intervals

The sample generated by Algorithm 1 contains information about the shape of the posterior density . This information can be employed to determine the Bayesian confidence intervals, also called credible intervals.

In this work an approach is presented which employs the percentile method [17] to analyze the uncertainty of Θ_{
φ
}. The 100*α*-th percentile of a random variable *r* is the value
below which 100*α* % of the observations fall. Accordingly, the 100(1-*α*)-th percentile interval of *r* is defined as
. The Bayesian confidence interval is frequently defined as the 95-th percentile interval [18]. Thus, the parameter is contained in
with a probability of 95% given the measurement data and the prior knowledge.

*α*-th percentile is not simply a number but a function. As we are interested in the confidence intervals of Θ

_{ φ }(

*θ*), the percentiles are defined point-wise for every

*θ*. The 100

*α*-th percentile of Θ

_{ φ }(

*θ*) is thus the function which gives for every parameter vector

*θ*the value under which 100

*α*% of the observations fall,

As the sample is given, an approximation of and can be obtained by studying the percentiles of the sample [26]. For instance the nearest rank method or linear interpolation between closest ranks can be used to determine .

### Predictions of output density

is determined.

*y*|

*t*,Λ

_{ j }) (12) are computed. Given ϒ(

*y*|

*t*,Λ

_{ j }) and the sample from the posterior density , a sample from the predicted output density is given by

This sample can be used to approximate the prediction confidence interval
. As the population model has to be simulated only *n*_{
φ
} times, this calculation is computationally tractable.

## Results and discussion

To illustrate the properties of the proposed methods, artificial measurement data of a cell population responding to a tumor necrosis factor (TNF) stimulus will be analyzed. For illustration purposes, we consider a case where only one parameter is distributed in a first step. In a second step, we show that the method is also applicable in the case of multi-parametric heterogeneity.

In multicellular organisms, the removal of infected, malfunctioning, or no longer needed cells is an important issue. Therefore, multicellular organisms developed different mechanisms to externally enforce cell death. Thereby the signaling molecule TNF is one of the key players.

TNF can bind to specific death receptors in the cell membrane and is able to induce programmed cell death, also called apoptosis, via the activation of the caspase cascade. On the other hand, it promotes cell survival via the inflammatory response, specifically activation of the NF-*κ* B pathway [27]. The proportion of the activation of these two signaling pathways decides about the fate of the single cell. In the following a simple model for the caspase and NF-*κ* B activation is studied which reproduces the main features of the single cell response to a TNF stimulus.

### Model of TNF signaling pathway

*κ*B (NF-

*κ*B) and its inhibitor I-

*κ*B are considered in the model. The model is given by the ODE system

*x*

_{ i },

*i*= 1, ..., 4 denote the relative activities of the signaling proteins C8a, C3a, NF-

*κ*B and I-

*κ*B, in this order. The functions act

_{ j }(

*x*

_{ i }) and inh

_{ j }(

*x*

_{ i }) represent activating and inhibiting interactions, respectively. They are given by

*a*

_{ j }and

*b*

_{ j }are activation and inhibition thresholds, respectively, and take values between 0 and 1. The external TNF stimulus is denoted by

*u*. Initial conditions of the single cells are the steady states with C3a = 0 for

*u*= 0. All nominal parameter values are given in Table 1.

Nominal parameter values for the TNF signaling model (41).

i | 1 | 2 | 3 | 4 | 5 |
---|---|---|---|---|---|

| 0.6 | 0.2 | 0.2 | 0.5 | |

| 0.4 | 0.7 | 0.3 | 0.5 | 0.4 |

It is known from experiments that the cellular response to a TNF stimulus is highly heterogeneous within a clonal cell population. Some cells die, others survive. The reasons for this heterogeneous behavior are unclear, but of great interest for biological research in TNF signaling, e.g. the use of TNF or related molecules as anti-cancer agent.

To understand the biological process at the physiological and biochemical level it is crucial to consider this cellular heterogeneity, using for example cell population modeling. Here, we model heterogeneity at the cell level via differences in the parameters *b*_{3} and *a*_{4}. The parameter *b*_{3} describes the inhibitory effect of NF-*κ* B via the C3a inhibitor XIAP onto the C3 activity, and the parameter *a*_{4} models the activation of I-*κ* B via NF-*κ* B. Studies showed that these two interactions show large cell-to-cell variability [4, 7, 28].

### Univariate parameter density

*y*=

*x*

_{2}, with measurement noise according to (7), where

*μ*

^{×}= 0,

*σ*

^{×}= 0.1,

*μ*

^{+}= log(0.05), and

*σ*

^{+}= 0.3. This corresponds to an average noise level of about 15%. The generated artificial experimental data for a bimodal distribution in

*b*

_{3}are depicted in Figure 6.

*s*

_{ j }such that . The center points

*μ*

_{ j }are equidistantly distributed on the interval [0,1]. The prior probability

*p*(Θ

_{ φ }) is chosen to be

*p*

_{ β }is the probability density of the beta-distribution. The parameter

*α*

_{ j }and

*β*

_{ j }are selected such that

*p*

_{ β }(

*φ*

_{ j }

*|α*

_{ j }

*|β*

_{ j }) has its extremum at and convariance

*σ*

^{2}. The distribution of a sample {

*φ*

^{ k }}

_{ k }drawn from this prior is shown in Figures 7 and 8. Note that the prior does not enforce a trend to smaller or larger parameter values of

*b*

_{3}. Furthermore, it does not enforce a trend to unimodal or bimodal distributions Θ

_{ φ }(

*b*

_{3}). Such distribution properties shall be inferred from the data.

Given the ansatz functions Λ_{
j
} (45) the conditional probabilities
of observing
are determined using importance sampling, according to (25). This computation takes about three minutes, on a standard personal computer using a single CPU. Thereby, 32% of the computation time are required for the simulation of the individual cells *y*(*t*, *θ*^{(j)}) for individual parameter values *θ*^{(j)}, and 59% for the evaluation of the conditional probability
. The rest is spent on pre- and post-processing. Subsequently, MCMC sampling is performed to obtain a sample
of the prior (with *σ*^{2} = 0.05), of the conditional, and of the posterior probability distribution. The sample has *S*_{
φ
} = 10^{6} members and the generation takes only four minutes. The computation is very fast, as the proposed approach simplified the evaluation of the conditional probability to a matrix vector multiplication. Note, that all steps during the computation of the conditional probabilities and the MCMC sampling can be parallelized, yielding a tremendous speed-up for more complex models.

The results of the sampling are illustrated in Figure 7 using parallel coordinates [29]. From this representation of
it can be seen that after the learning processes most of the density parameters still show large uncertainties. The uncertainty in the posterior distribution is a lot smaller than the uncertainty in the likelihood function, due to the stabilization via the prior. Note that the visualization also uncovers pronounced correlations between some parameters, e.g. *φ*_{10} and *φ*_{11} are negatively correlated for
. This indicates that the model of the density of *b*_{3} is over-parameterized with respect to the data. Thus, the number of ansatz functions could be reduced while still achieving a good fit.

To analyze the uncertainty of Θ_{
φ
} in more detail the sample
is employed to determine the 80%, 90%, 95%, and 99% Bayesian confidence intervals. The results are depicted in Figure 8. It can be seen that the confidence intervals for some values of *b*_{3} are rather small, indicating that the data contain many information about these regions. Unfortunately, in particular for *b*_{3} > 0.6 the confidence intervals are very wide showing that the parameter density in this area cannot be inferred precisely. But, although the amount of data is limited and the uncertainty with single *φ*_{
i
}'s may be large, the posterior distribution of Θ_{
φ
} already shows key properties of the true parameter density, e.g. the bimodal shape, which has not been provided as prior information. This bimodal shape is also seen in the likelihood function, but there the uncertainties are larger than in the posterior probability distribution.

_{ φ }also the predictive power of the model can be evaluated. This is exemplified by studying the confidence interval of and for the previously considered experimental setup. The bar indicates that the distribution of the noise corrupted output instead of the true output

*y*is considered. This allows the direct comparison of the prediction with the data. The predictions are shown in Figure 9.

It is obvious that, although the parameter distributions show large uncertainties, the predictions are rather certain. This is indicated by tight confidence intervals. Furthermore, the mean predictions
and the predictions with highest posterior probability
agree well with the true output distribution
, for measured output C3a and predicted output NF-*κ* B. The small prediction uncertainties can be explained to be sloppiness [30] of the density parameters *φ*_{
i
} parametrizing the distribution of *b*_{3}. A detailed analysis indicates (not shown here) that the number of ansatz function can be decreased, still ensuring a good approximation of the distribution of *b*_{3}.

### Multivariate parameter density

In many biological systems several cellular parameters are heterogeneous and different cellular concentrations can be measured [7]. Therefore, we show in this section that the proposed method can also be employed to estimated multivariate parameter densities from multi-dimensional outputs. Furthermore, the influence of the choice of the prior on the estimation result is analyzed.

*κ*B,

*y*= [

*x*

_{2},

*x*

_{3}]

^{ T }. The considered artificial experimental data of 10

^{4}cells are depicted in Figure 10. The ansatz function for Θ

_{ φ }are

*n*

_{ φ }= 100 truncated multivariate Gaussians equivalently to (45). The covariance matrix is 0.06

^{2}· I

_{2}and the extrema are equidistantly distributed on a regular 2-dimensional grid which is aligned with the axes.

*φ*

_{ext}are chosen as in the last section such that the prior is flat. The standard deviation on the other hand is reduced step-wise from

*σ*= 0.285 (completely uninformative as almost uniform on the feasible interval to

*σ*= 0.001 (very informative). Given this requirements, the values

*α*

_{ i }and

*β*

_{ i }of the prior (46) are determined. The result for different numbers of measured cells sampled from the available data set is shown in Figure 11. Note that the IMSE is a stochastic quantity as the selection of measured cells is a stochastic processes and hence also the estimated density is stochastic. To account for this stochasticity, several realizations are performed and the mean is computed.

*σ*, one observes a fast convergence of the to Θ

^{true}, as shown in Figure 12, and only little variation for small amounts of data. Hence, the usage of prior knowledge, even if it is only partially correct, yields for more stable estimates and faster convergence. Furthermore, this study suggests that the typical number of cells measured by flow cytometry (10

^{4}) is informative enough to infer key features of population heterogeneity.

## Conclusions

In this paper a Bayesian approach for inferring the parameter density for heterogenous cell populations with single cell resolution from population data is presented. We show that the proposed model can deal with limited and noisy measurement data as well as realistic noise models. The method utilizes a parameterization of the parameter density which, in combination with a reformulation of the conditional probability, allows a computationally efficient evaluation of the posterior probability. Compared to other methods for cell populations this approach does not rely on the approximation of the measured population response using density estimators.

For sampling from the posterior probability the Metropolis-Hastings algorithm is used. Here it has been adapted to be applicable to the considered constraint problem. Using this sampling strategy a sample from the posterior probability density is determined. This sample is employed to compute Bayesian confidence intervals for the parameter distribution, as well as for the model predictions. Also summary statistics like mean parameter density and mean predicted output density can easily be determined. For concave prior distributions we could even show that the calculation of the parameter density for the highest posterior probability is a convex problem.

The properties of the proposed scheme are evaluated using artificial data of a TNF signal transduction model. It could be shown that the proposed method yields good estimation results for a realistic experimental setup. Furthermore, although the remaining uncertainties are large, the predictions showg only small uncertainty indicating sloppiness of parameters.

Concerning the choice of the prior distribution it could be shown that the regularizing effect is beneficial if only little data is available. On the other hand, if the amount of available data increases, informative but not carefully chosen priors slow down the convergence.

## Declarations

### Acknowledgements

The authors acknowledge financial support from the German Federal Ministry of Education and Research (BMBF) within the FORSYS-Partner program (grant nr. 0315-280A and D), from the German Research Foundation within the Cluster of Excellence in Simulation Technology (EXC 310/1) at the University of Stuttgart, and from Center Systems Biology (CSB) at the University of Stuttgart.

## Authors’ Affiliations

## References

- Henson MA: Dynamic modeling of microbial cell populations. Curr Opin Biotechnol 2003, 14(5):460–467. 10.1016/S0958-1669(03)00104-6View ArticlePubMedGoogle Scholar
- Mantzaris NV: From single-cell genetic architecture to cell population dynamics: Quantitatively decomposing the effects of different population heterogeneity sources for a genetic network with positive feedback architecture. Biophys J 2007, 92(12):4271–4288. 10.1529/biophysj.106.100271PubMed CentralView ArticlePubMedGoogle Scholar
- Munsky B, Trinh B, Khammash M: Listening to the noise: random fluctuations reveal gene network parameters. Mol Syst Biol 2009, 5(318):1–7.Google Scholar
- Spencer SL, Gaudet S, Albeck JG, Burke JM, Sorger PK: Non-genetic origins of cell-to-cell variability in TRAIL-induced apoptosis. Nat 2009, 459(7245):428–433. 10.1038/nature08012View ArticleGoogle Scholar
- Stamatakis M, Zygourakis K: A mathematical and computational approach for integrating the major sources of cell population heterogeneity. J Theor Biol 2010, 266(1):41–61. 10.1016/j.jtbi.2010.06.002PubMed CentralView ArticlePubMedGoogle Scholar
- Avery SV: Microbial cell individuality and the underlying sources of heterogeneity. Nat Rev Microbiol 2006, 4: 577–587. 10.1038/nrmicro1460View ArticlePubMedGoogle Scholar
- Albeck JG, Burke JM, Aldridge BB, Zhang M, Lauffenburger DA, Sorger PK: Quantitative analysis of pathways controlling extrinsic apoptosis in single cells. Mol Cell 2008, 30(1):11–25. 10.1016/j.molcel.2008.02.012PubMed CentralView ArticlePubMedGoogle Scholar
- Waldherr S, Hasenauer J, Allgöwer F: Estimation of biochemical network parameter distributions in cell populations. Proc. of the 15th IFAC Symp. on Syst. Ident 2009, 15: 1265–1270.Google Scholar
- Al-Banna MK, Kelman AW, Whiting B: Experimental design and efficient parameter estimation in population pharmacokinetics. J Pharmacokin Biopharm 1990, 18(4):347–360. 10.1007/BF01062273View ArticleGoogle Scholar
- Banks HT, Suttona KL, Thompson WC, Bocharov G, Roose D, Schenkel T, Meyerhans A: Estimation of cell proliferation dynamics using CFSE data. Bull Math Biol 2010, 73(1):116–150.PubMed CentralView ArticlePubMedGoogle Scholar
- Luzyanina T, Roose D, Bocharov G: Distributed parameter identification for label-structured cell population dynamics model using CFSE histogram time-series data. J Math Biol 2009, 59(5):581–603. 10.1007/s00285-008-0244-5View ArticlePubMedGoogle Scholar
- Luzyanina T, Roose D, Schenkel T, Sester M, Ehl S, Meyerhans A, Bocharov G: Numerical modelling of label-structured cell population growth using CFSE distribution data. Theor Biol Med Model 2007, 4(26):1–14.Google Scholar
- Hasenauer J, Waldherr S, Doszczak M, Scheurich P, Allgöwer F: Density-based modeling and identification of biochemical networks in cell populations. In Proc. of 9th Int. Symp. on Dynamics and Control of Process Syst. (DYCOPS 2010), Leuven, Belgium, July 5–7 Edited by: Kothare M, Tade M, Wouwer AV, Smets I. 2010, 306–311.Google Scholar
- Hasenauer J, Waldherr S, Radde N, Doszczak M, Scheurich P, Allgöwer F: A maximum likelihood estimator for parameter distributions in heterogeneous cell populations. Procedia Computer Science 2010, 1(1):1649–1657.View ArticleGoogle Scholar
- Klinke DJ: An empirical Bayesian approach for model-based inference of cellular signaling networks. BMC Bioinf 2009, 10(371):1–18.Google Scholar
- Wilkinson DJ: Bayesian methods in bioinformatics and computational systems biology. Briefings in Bioinf 2007, 8(2):109–116.View ArticleGoogle Scholar
- Joshi M, Seidel-Morgenstern A, Kremling A: Exploiting the bootstrap method for quantifying parameter confidence intervals in dynamical systems. Metabolic Eng 2006, 8: 447–455. 10.1016/j.ymben.2006.04.003View ArticleGoogle Scholar
- Raue A, Kreutz C, Maiwald T, Bachmann J, Schilling M, Klingmüller U, Timmer J: Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinf 2009, 25(25):1923–1929.View ArticleGoogle Scholar
- MacKay DJC: Information Theory, Inference, and Learning Algorithms. Cambridge University Press; 2005.Google Scholar
- Chaves M, Eissing T, Allgöwer F: Bistable biological systems: A characterization through local compact input-to-state stability. IEEE Trans Autom Control 2008, 53: 87–100.View ArticleGoogle Scholar
- Kreutz C, Bartolome Rodriguez MM, Maiwald T, Seidl M, Blum HE, Mohr L, Timmer J: An error model for protein quantification. Bioinf 2007, 23(20):2747–2753. 10.1093/bioinformatics/btm397View ArticleGoogle Scholar
- Gander W, Gautschi W: Adaptive quadrature-revisited. Bit Numerical Mathematics 2000, 40(18):84–101.View ArticleGoogle Scholar
- Silverman BW: Density Estimation for Statistics and Data Analysis. Monographs on Statistics and Applied Probability. London: Chapman and Hall; 1986.View ArticleGoogle Scholar
- Surulescu C, Surulescu N: A nonparametric approach to cells dispersal. Int J Biomath Biostat 2010, 1: 109–128.Google Scholar
- Boyd S, Vandenberghe L: Convex Optimisation. Cambridge University Press, UK; 2004.View ArticleGoogle Scholar
- DiCiccio TJ, Efron B: Bootstrap confidence intervals. Statist Sci 1996, 11(3):189–228.View ArticleGoogle Scholar
- Wajant H, Pfizenmaier K, Scheurich P: Tumor necrosis factor signaling. Cell Death Differ 2003, 10: 45–65. 10.1038/sj.cdd.4401189View ArticlePubMedGoogle Scholar
- Paszek P, Ryan S, Ashall L, Sillitoe K, Harper CV, Spiller DG, Rand DA, White MRH: Population robustness arising from cellular heterogeneity. PNAS 2010, 107(25):1–6.View ArticleGoogle Scholar
- Inselberg A, Dimsdale B: Parallel coordinates: a tool for visualizing multi-dimensional geometry. Proc of IEEE Visualization 1990, 361–378.Google Scholar
- Gutenkunst RN, Waterfall JJ, Casey FP, Brown KS, Myers CR, Sethna JP: Universally sloppy parameter sensitivities in systems biology models. PLoS Comput Biol 2007, 3(10):1871–1878.View ArticlePubMedGoogle Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.