- Research
- Open access
- Published:

# Stochastic simulations of minimal cells: the Ribocell model

*BMC Bioinformatics*
**volume 13**, Article number: S10 (2012)

## Abstract

### Background

Over the last two decades, lipid compartments (liposomes, lipid-coated droplets) have been extensively used as in vitro "minimal" cell models. In particular, simple and complex biomolecular reactions have been carried out inside these self-assembled micro- and nano-sized compartments, leading to the synthesis of RNA and functional proteins inside liposomes. Despite this experimental progress, a detailed physical understanding of the underlying dynamics is missing. In particular, the combination of solute compartmentalization, reactivity and stochastic effects has not yet been clarified. A combination of experimental and computational approaches can reveal interesting mechanisms governing the behavior of micro compartmentalized systems, in particular by highlighting the intrinsic stochastic diversity within a population of "synthetic cells".

### Methods

In this context, we have developed a computational platform called ENVIRONMENT suitable for studying the stochastic time evolution of reacting lipid compartments. This software - which implements a Gillespie Algorithm - is an improvement over a previous program that simulated the stochastic time evolution of homogeneous, fixed-volume, chemically reacting systems, extending it to more general conditions in which a collection of similar such systems interact and change over the course of time. In particular, our approach is focused on elucidating the role of randomness in the time behavior of chemically reacting lipid compartments, such as micelles, vesicles or micro emulsions, in regimes where random fluctuations due to the stochastic nature of reacting events can lead an open system towards unexpected time evolutions.

### Results

This paper analyses the so-called Ribocell (RNA-based cell) model. It consists in a hypothetical minimal cell based on a self-replicating minimum RNA genome coupled with a self-reproducing lipid vesicle compartment. This model assumes the existence of two ribozymes, one able to catalyze the conversion of molecular precursors into lipids and the second able to replicate RNA strands. The aim of this contribution is to explore the feasibility of this hypothetical minimal cell. By deterministic kinetic analysis, the best external conditions to observe synchronization between genome self-replication and vesicle membrane reproduction are determined, while its robustness to random fluctuations is investigated using stochastic simulations, and then discussed.

## Background

In recent years, many researchers have been actively working in the field of the *de novo* synthesis of the artificial cell [1–5], i.e. a cell made from scratch using both synthetic and natural compounds. This scientific challenge has many relevant aspects: first of all, it can reinforce the theory of abiogenesis in the origins of life debate [6], proving that life can emerge spontaneously in a test tube, at least in suitable experimental conditions. Furthermore, the possibility to create a population of artificial cells programmed for the synthesis of chemical compounds of pharmacological and industrial interest is a significant biotechnological goal [7, 8]. Artificial cells can also be envisaged, in the maybe not too distant future, as microscopic diagnostic and pharmacological labs to be delivered into the human body in order to synthetize and release drugs as a response to an external stimulus in presence of a disease [9].

The topic of artificial cells is strongly related to the minimal cell notion that defines the simplest cell to be considered alive and then to be experimentally implemented. A minimal living cell, or protocell, can be defined as the minimum supramolecular bounded structure based on the lowest number of molecular species and metabolic processes that is capable of self-maintaining, self-reproducing and evolving [10]. *Self-Maintenance* is the necessary condition that must be fulfilled by the protocell to be considered alive, i.e. it must stay in a steady state where all its constituents are continuously synthesized and refurbished [11]. Nevertheless, this is not sufficient to implement cellular life as we actually know it; in fact, real cells are also able to grow and to self-reproduce. On the other hand, *Self-Reproduction* can be seen as a consequence of the cellular metabolism that can keep the minimal cell in a stationary state or bring it to a continuous growing regime that forces the organism to divide in order to maintain its internal coherence, i.e. its stability. Both these features are individual properties of a single cell that can be observed during its life time. In contrast, *Evolvability* is a collective property that can be exhibited only by a population of cells and on a time scale of several generations, according to a Darwinian selection mechanism [6, 12].

Some years ago, Szostak and colleagues proposed a minimal cell prototype called the *Ribocell*: the RNA-based cell, that in principle can exhibit all the three properties to be considered a living protocell [13]. This theoretical cellular model consists in a self-replicating minimum genome coupled with the self-reproduction of the lipid vesicular container [14]. These authors envisaged the existence of two hypothetical ribozymes [13], one (R_{L}) able to catalyze the conversion of molecular precursors (P) into membrane lipids (L) and the other one (R_{P}) able to duplicate RNA strands. Therefore, in an environment rich in both lipid precursors (P) and activated nucleotides (NTPs), the Ribocell can self-reproduce if both processes, i.e. genome self-replication and membrane reproduction (growth and division), are somehow synchronized.

In previous papers [15, 16], we have presented an *in silico* implementation of the Ribocell based on the internal metabolism reported in Figure 1 and on the recently introduced self-replicating lipid vesicle model [17]. By means of a deterministic analysis, we showed that if the kinetic constant for lipid formation *k*_{
L
} is in the range: 1.7·10^{3}s^{-1}M^{-1}≤*k*_{L}≤1.7·10^{5}s^{-1}M^{-1} then synchronization between vesicle reproduction and genome replication can spontaneously emerge under the model assumptions and kinetic parameters reported in Table 1. Deterministic calculations were performed for two ribozymes 20 bases long and showed that the Ribocell reaches a stationary growth and division regime, where the cell size remains constant after each division along with the amount of genetic materials. Although the observed cell life time stabilizes after the first 10 generations, it remains very high, at over 80 days for all the *k*_{
L
}values in the synchronization range, making the Ribocell very hard to implement and study experimentally.

In this paper, we first apply this model to 100-base-long ribozymes, in an attempt to find the best experimental conditions to reduce so as Ribocell life time. By using the deterministic approach, the robustness of the stationary growth and division regime will be investigated in terms of external substrate concentrations, vesicle size and initial ribozyme amount in order to define optimal external conditions for Ribocell self-reproduction.

Therefore, the influence of ribozyme length will also be explored in the optimal external conditions by ranging strand size from 20 to 200 bases in length and keeping all the other kinetic parameters constant. 20 bases is in fact the minimum length required to observe a folded RNA structures, i.e. a structure that can reasonably exhibit catalytic action. On the other hand, entities of about 200 nucleotides have been suggested as plausible ancient proto-ribosomes [18] even though, more recently, smaller subunits of 60 nucleotides have also been considered as plausible candidates [19]. Moreover, the dependence of Ribocell time behavior on the kinetic constants of RNA dimer formation and dissociation will also be studied.

Finally, stochastic simulations will be performed in order to test the robustness of the ribocell base on 100-base length ribozymes in optimal external conditions, with the aim of elucidating the role of intrinsic and extrinsic stochasticity on the time behavior of a protocell population.

## Methods

Self-reproducing vesicles are compartmentalized chemically-reacting systems where self-assembly processes are coupled with chemical reactions that produce amphiphilic molecules. To study the time behavior of these systems, we use both a deterministic and a stochastic approach in order to get insights into the average behavior of the protocell population and, at the same time, to elucidate the role of random fluctuations. Given a certain minimal cell model, i.e. a reaction mechanism with all the required parameters (kinetic constants, permeability coefficients, initial concentrations), the deterministic analysis can be done by numerically solving the ordinary differential equation set (ODES) or by analytically integrating an approximated reduced set of differential equations. Examples of the latter approach can be found in our previous works where self-reproducing micelles [20] and vesicles [21] were studied. In order to take into account the stochastic effects, some years ago we developed a Monte Carlo program [22] based on the Gillespie's direct method [23] designed for coupling amphiphile self-assembly [24] with chemical reactions and hydrophobic solute absorption [25] in a homogeneous well stirred macroscopic reactor. More recently, this program has been improved and a computational platform called ENVIRONMENT [26] was later developed to cope with the more general case of a collection of reacting lipid compartments that interact and change over time. In the next sub-sections, the general model for a self-reproducing vesicle will be recalled and discussed [17].

*In silico* chemically reacting vesicles

A chemically reacting vesicle can be described as a homogeneous reacting aqueous domain enclosed by a lipid bilayer. Molecules can be exchanged with the external environment thanks to transport processes through the lipid membrane. A flux of water can also take place through the membrane in order to balance the osmotic pressure, i.e. the difference between the internal and the external overall concentrations. Chemical reactions can occur in the vesicle water core, according to the assumed internal metabolism, and amphiphilic molecules can be absorbed from and released towards both the external and internal aqueous solutions. Hence, the vesicle time state is defined by the following array:

where *n*_{i}^{C} are the molecular numbers of species X_{i} (*i* = 1,2 ... *N*) present in the vesicle aqueous core and *n*_{L}^{μ} is the number of amphiphiles X_{
L
} (1≤ *L* ≤*N*) in the membrane. *V*_{C} is the water internal volume. In the stochastic approach, all *n*_{i}^{C} and *n*_{L}^{μ} are discrete integer numbers and there exist as many arrays as vesicles in the systems, while in the deterministic analysis there is a single array with real values that represents the average time state of the entire reacting vesicle population.

Table 2 compares the concentration change rates with the event propensity probability density functions for all the mentioned processes. The deterministic reaction rates are given according to the mass action law and they make it possible to write down the deterministic set of ordinary differential kinetic equations. On the other hand, the propensity probability density functions are used to carry out Monte Carlo simulations according to the Gillespie's direct method by means of the ENVIRONMENT software [26]. Moreover, in the stochastic simulations, the water flux will be considered instantaneous due to the high permeability of lipid membranes to water. At each step throughout the MC run, the aqueous vesicle core volume is rescaled in order to keep the vesicle in an osmotic balanced state by using the formula reported in Table 2. Conversely, deterministic calculation will explicitly take into account the water flux dynamics through the lipid membrane. However, a discrepancy between the two approaches can emerge only in presence of an abrupt change of the osmolite external concentration, due to fast dilution or solute addition of the external solution [26]. Another main difference between the two approaches is how the encapsulated molecules are distributed between the daughters after protocell division. In the deterministic approach, all the molecular content of the mother vesicle is halved while it is randomly distributed between the twin daughters in the stochastic simulations.

### Self-reproducing vesicles

The vesicle surface is estimated by the formula: S_{
μ
} = (*α*_{
L
}n_{
L
}^{μ})/2 where *α*_{
L
}is the amphiphile X_{
L
} head area and 1/2 takes into account the double layered structure of the membrane. In presence of the synthesis of fresh amphiphiles, the membrane surface and the aqueous core volume can follow two different dynamics and this may bring the vesicles towards unstable conditions. The stability of the vesicle membrane can then be monitored by means of the reduced surface *ϕ*:

that is, the ratio between the actual membrane surface *S*_{μ} and the spherical area that would perfectly wrap the actual core volume V_{
C
}. Assuming that, for a given size, the spherical shape (*ϕ* = 1) represents the minimum energy state, swollen (*ϕ* < 1) and deflated (*ϕ* > 1) vesicles are in high energetic conditions due to the elastic and the bending tension, respectively. Therefore, vesicles are assumed to be stable only in a small range of *ϕ* values around 1:

*ε* and *η* being the osmotic and dividing tolerance coefficients, respectively. In fact, vesicles in hypotonic solutions can swell, stretching the membrane until they reach a critical state: *ϕ* = 1-*ε*. The osmotic tolerance *ε* can be experimentally determined by measuring the maximum difference in osmolite aqueous concentrations -between the internal core (*C*_{
T
}^{C}) and the external environment (*C*_{
T
}^{E})- that vesicles can bear. For oleic acid, *ε* was found equal to 0.21 [27]. In our model, at the critical bursting point, vesicles are assumed simply to break down, releasing all their internal content into the external environment and remaining in solution as flat bilayers, since bilayer sealing processes are not considered in this model. On the other hand, deflated vesicles are supposed to be able to divide in order to minimize the bending energy. The dividing condition is reached when they can form two equal volume spherical daughters (*ϕ* = 2^{1/3}). So *η* introduces a tolerance that is linked to the relative flexibility exhibited by any membrane. However, as a simplifying assumption in this work, *η* = 0 will be supposed. As already mentioned, after each division, all the molecular content of the mother vesicle is halved in the deterministic approach, while it is randomly distributed between the twin daughters in the stochastic simulations.

### The Ribocell model

Figure 1 reports the internal metabolism proposed for the Ribocell. Both pairs of RNA strands reversibly associate (1) and these processes are shifted towards the dimer formation and are strongly dependent on temperature. The replication of any RNA strand is catalyzed by the polymerase R_{P} according to the steps in bracket (2). This process is described as a catalytic template-directed addition of mononucleotides with high fidelity and processivity. It starts with R_{P} binding any of the monomeric strands S (S = R_{P}, _{c}R_{P}, R_{L} and _{c}R_{L}) to form the complex R@S. This complex will then initiate the polymerization of the conjugate strand _{c}S, by coupling and iteratively binding the complementary bases and releasing the by-product W. When the strand _{c}S has been completely formed, the polymerase ribozyme releases the new dimer. Finally, the ribozyme R_{L} catalyzes the conversion of the precursor P into the lipid L (3).

Table 1 shows the values of all kinetic constants and membrane permeabilities used in this work, with their respective references. In this work, the length of the two ribozymes is assumed to be 100 nucleotides long (20 being the minimum base number for observing a folded RNA conformation). Both R_{L} and R_{P} are created with a random sequence of bases, and they are assumed to have similar kinetic behaviors for the sake of simplicity. The kinetic constants of formation *k*_{SS} and dissociation *k*_{S} of both dimers were set equal to the values experimentally measured for a sequence of 10 nucleotides [28]. The kinetic constants for both complex formation R@S and complex dissociation R@S_{c}S (S = R_{P}, _{c}R_{P}, R_{L} and _{c}R_{L}) were set equal to those measured for the human enzyme β-polymerase [29] that binds a DNA single strand and dissociates from a DNA double helix, respectively. The rate constant for the catalytic synthesis of lipid *k*_{
L
} was taken to be 10^{5} times larger than that of the splicing reaction, catalyzed by the hammerhead ribozyme [30]. The kinetic behavior of different nucleotides is assumed to be the same and a single value is assigned to *k*_{NTP} derived from experimental data simulations (De Frenza private communication) of the DNA template directed synthesis in fatty acid mixed vesicles [31]. In the same way, the common value of the membrane permeability to activated nucleotides was also estimated, while the membrane is assumed impermeable to genetic material. The kinetic constants of the membrane/aqueous solution lipid exchange for oleic acid vesicles are taken from a previous work [26] where they were obtained by simulating the competition between isotonic and osmotically swollen oleic vesicles [27]. The amphiphile head area *α*_{
L
} = 0.3 nm^{2} and the osmotic tolerance ε = 0.21 are defined according to data reported in literature for oleic acid vesicles [27]. The only two parameters assigned arbitrarily are therefore the membrane permeability to the byproduct: *P*_{W} = 0.0 cm/s, based on the assumption that W is a charged species, and the permeability to the precursor: *P*_{P} = 0.42·10^{-8} cm/s, corresponding to the oleic acid membrane's permeability to Arabitol [32] and comparable to those of similar organic compounds.

A common simplifying assumption to both approaches is to consider the external concentrations of nucleotides (NTPs), lipid precursor (P) and inert compound (B) to be constant throughout the time-courseof the process, thanks to an incoming flux of material in the reactor vessel: *continuous stirred tank reactor approximation*. The byproduct (W) concentrations is also assumed to be constantly equal to zero outside. For all these compounds, except B, the internal aqueous concentration is zero at the beginning. Moreover, all the calculations are performed starting from an initial isotonic condition, so that the internal concentration of B is properly adjusted in order to counterbalance the presence of the lipid precursor and nucleotides outside and the genetic material inside, respectively.

As has been pointed out before, given the set of kinetic parameters reported in Table 1 we are looking for a set of initial concentrations that can allow the Ribocell to reach a stationary regime of growth and division, i.e. a dynamic state where the protocell grows and divides producing two twin daughters the same size as their mother at the beginning of its life cycle.

In order to achieve this in our model, a spontaneous synchronization between membrane and the aqueous volume core of the self-replicating vesicle must take place. By introducing *the control growth coefficient* γ as the ratio between the relative change rates of volume *v*_{V} and surface *v*_{S}:

it is easy to show(unpublished observations) for growing protocells that synchronization can take place only if *γ* = 1, while if *γ* > 1 the volume will increase much faster than the membrane surface and the vesicle can become energetically unstable, leading to an osmotic burst. On the other hand, if *γ* < 1 then the vesicle will divide decreasing in size generation by generation. It is beyond the scope of this work to go into the mathematical details of this formula, so the interested reader should refer to an incoming paperfor a more detailed discussion.

Nevertheless, it is worthwhile to keep in mind that *v*_{S} is essentially proportional to the rate of lipid synthesis, since amphiphile uptake by the membrane is very fast when the concentration of lipids is above the equilibrium value. Instead, having assumed the external value *C*_{T}^{E} to be constant and a vesicle being in a osmotic balanced condition: *C*_{T}^{E}≈ *C*_{T}^{C} = *V*_{C}*N*_{T}^{C},*v*_{V} is driven by the overall internal *N*_{T}^{C} population rise. Therefore, since *N*_{T}^{C} increases essentially owing to the waste production that takes place with ribozyme self-replication and lipid synthesis, this has the effect of coupling the membrane reproduction with the genome replication and with the volume growth: *osmotic synchronization*.

When the Ribocell reaches a stationary regime, at each division the genetic materials can be randomly distributed between the daughters. If the amount of genetic material is very low, then this can result in a separation of R_{P} from the other RNA strands. In fact, the Ribocell must contain a minimum genetic kit of three RNA filaments in order to be capable of self-replicating its entire genome: one R_{P} that catalyzes the RNA base pair transcription, one (R_{L} or _{c}R_{L}) and one (R_{P} or _{c}R_{P}) that work as templates for the transcription. Moreover, since R_{L} is necessary to catalyze lipid precursor conversion, the optimal minimum 3-ribozyme kit must be made up of 2R_{P} and one R_{L}. This minimum kit should be at least doubled before cell division, in order to have a chance that both daughters continue to be active. Therefore, if a random distribution of RNA filaments takes place after vesicle division, ribozyme segregation between the two daughters might occur. Different scenarios can be envisaged as sketched in Figure 2: *death by segregation* is reached if vesicles are produced without any ribozymes (*empty vesicles*) or containing one lone R_{P} or many filaments of _{c}R_{P} and/or _{c}R_{L} (*inert vesicles*). Vesicles that encapsulate R_{L} strands are *self-producing*: they are able to synthesize lipids and then can grow and divide producing in turn self-producing and/or empty vesicles. On the other hand, vesicles containing more than one molecule of R_{P} or both R_{P} and _{c}R_{P} filaments are able to self-replicate this reduced genome (*self-replicating genome vesicles*) but they cannot self-reproduce the membrane. So they are destined for an osmotic burst due to an unbalanced increase in waste concentration. Finally, a reduced version of the Ribocell consists in a lipid aggregate that contains one R_{P} filament and R_{L}/_{c}R_{L} strands. As a consequence of this, *reduced ribocells* are able to replicate the R_{L}/_{c}R_{L} genetic stuff, and at the same time to synthesize lipids. Therefore, they can grow and divide, producing in turn at least one reduced ribocell and/or self-replicating, inert and empty vesicle. With the help of stochastic simulations, we will try to explore all the possible scenarios.

## Results and discussion

### Deterministic analysis

In the present paper, we firstly explore the dependence of the stationary regime on the external concentrations of substrates for ribozymes 100 nucleotides long setting *k*_{L} to 1.7·10^{3}s^{-1}M^{-1}, i.e. the minimum value in the previously observed synchronization range. The aim of this preliminary deterministic study is to find the optimal initial conditions in order to achieve a stationary regime with the shortest life time. All the outcomes are reported in additional file 1.

As first, the dependence of Ribocell state on overall external concentration *C*_{T}^{E} is analyzed at the stationary regime, reached after 20 generations. Since [*P*_{ex}] = [*N*_{ex}] = 5.0·10^{-4}M, the overall external concentration can be approximated to *C*_{T}^{E}≈[*I*_{ex}]. The upper plots in Figure 3 show that when the overall external concentration [*I*_{ex}] increases, then the Ribocell radius *ρ*_{20} decreases, while the life cycle Δ*t*_{20} rises. Thus, vesicles become smaller and more dormant as the overall external concentration rises. This can be ascribed to the mechanism of synchronization itself and is in agreement with what we reported in a recent work (unpublished paper) where an inverse dependence of the vesicle steady size on overall external concentration was explicitly derived from the general stationary condition *γ* = 1. On the other hand, the observed increase in Ribocell life time is a direct consequence of the reduction in size, since a smaller membrane surface decreases the transport efficiency of substrates (lipid precursor and nucleotides) from outside. As a consequence of this, all metabolic processes slow down since they are sustained by the transport of external substrates. These two effects, i.e. the increase in lifetime and the slowdown of the metabolism, determine the linear rise in concentration of the overall genetic material, see the lower left plot of Figure 3. At the steady regime, the genome composition is almost independent of [*I*_{
ex
}], as shown by the lower right plot of

Figure 3. Two equal fractions of R_{L} and _{c}R_{L} strands are present, both around 33%, while the percentages for R_{P}, and _{c}R_{P} are lower: 24% and 10%, respectively. Due to the high *k*_{ss} value, the ribozymes are mainly present inthe form of dimers and this accounts for the equal fractions observed for the lipase ribozymes, while the percentage of R_{P} is greater than that of _{c}R_{P}, since some R_{P} strands are involved as catalyzers in template duplication. This also explains why the overall percentage of polymerase ribozymes (~34%) is lower than that of lipase ribozymes (~66%) in fact, not all polymerase ribozymes are available as templates for duplication. Moreover, this asymmetry is amplified as long as the total concentration of the genetic material increases, see data in additional file 1.

Setting [*I*_{ex}] = 0.3 M, we study the dependence of the stationary division regime on the external concentration of the substrates: lipid precursor [*P*_{ex}] and nucleotides. The same concentration value [*N*_{ex}] is set for the four different nucleotides since they have been assumed to have the same kinetic behavior. The upper plot of Figure 4 shows the opposite effects of [*N*_{ex}] and [*P*_{ex}] on the stationary vesicle radius *ρ*_{20}. Higher [*N*_{ex}] concentrations speed up genome self-replication with respect to lipid synthesis, accelerating waste production and leading to larger core volumes. Conversely, increasing [*P*_{ex}] reduces *ρ*_{20} since membrane self-reproduction becomes faster. For the same reason, the total concentration of genetic material is increased due to the high concentrations of nucleotides and the low concentrations of the lipid precursor, while, both substrate concentrations decrease cell life time when they are increased, since all the metabolic processes are accelerated. If [*N*_{ex}] ≥ 0.05 M, the Ribocell undergoes an osmotic burst since volume growth is too fast compared to lipid production for any value of [*P*_{ex}] in the studied range (see additional file 1).

In the upper plots of Figure 5, the time trends of the core volume on the left, the growth control coefficient *γ* and the reduced surface *ϕ* (on the right) are reported for a Ribocell starting with a number of dimers N_{0} = 100 for both R_{c}R_{L} and R_{c}R_{P}. These plots show that after a few generations the steady division regime is reached, as confirmed by *γ* that tends to 1.0. At the real beginning, the Ribocell undergoes a fast volume increase, as demonstrated by *γ* > 1.0 and *ϕ* < 1.0. This is due to the transport process of the lipid precursor and the nucleotides from the external environment that blows up the vesicle. In the lower plot of Figure 5, the division time is reported against the number of generations for ribocells starting with a different initial number of strands N_{0}. In all three cases, the same division time is reached after 10 generation:s, i.e. 68.2 days, that remains constant for the following generations. The higher N_{0}, the faster the cell division in the first generations.

Having determined optimal external conditions, the influence of ribozyme length is now investigated by keeping all the other kinetic parameters constant. Calculations have been performed changing in turn the length of R_{L} or R_{P}, and fixing the size of the other ribozyme to 100 bases, or changing the size of both R_{L} and R_{P} but keeping the same length. Results are reported in Figure 6. In all the studied cases, an increase in strand length determines at the stationary regime Ribocells with a longer life cycle Δ*t*_{25}, with a larger radius ρ_{25} and a lower RNA total concentration [RNA Strands]. The variations observed are quite small compared to those for 100-base ribozymes, except for the [RNA Strands] that show a change about 15%. Furthermore, the Ribocell shows to be much more sensitive to the change in size of the polymerase ribozyme R_{P} rather than R_{L}. This can be ascribed to the fact that, being longer, R_{P}, requires more time to self-replicate and this decreases the overall concentration of all the polymerase ribozymes and in turn the efficiency of genome self-replication and membrane self-reproduction.

Finally, Figure 7 displays the dependence of Δ*t*_{25} on the kinetic constants for RNA dimer formation *k*_{SS} and dissociation *k*_{S}. The plot clearly shows that the Ribocell life cycle at stationary regimes does not depend explicitly on the kinetic constant single values *k*_{SS} and *k*_{S} but on their ratio: *k*_{SS}/*k*_{S}, that is on the thermodynamic constant of RNA dimerization. The more thermodynamically stable the RNA dimers, the longer it takes to observe Ribocell self-reproduction. For instance, if *k*_{SS}/*k*_{S} is decreased by two orders of magnitude, the Ribocell life time reduces from 68.2 days to 11.8-6.4 days. The study of Ribocell time behavior approaching the stationary regime as a function of *k*_{SS} and *k*_{S} values would require a much deeper analysis that is beyond the scope of this paper.

### Stochastic simulations

Stochastic simulations were performed by means of the parallel version of ENVIRONMENT, running 32 statistically equivalent simulations of a 10-ribocell solution on different CPUs. Therefore, the outcomes were obtained as averages from a population of 320 vesicles. Kinetic parameters used for simulations are those reported in Table 1 while the initial conditions are shown in bold by additional file 1. At each cell division, only one of the two offspring was kept while the other was discarded in order to reduce computation time, thus keeping the number of monitored vesicles constant. This is in agreement with the assumption that the external concentrations of all substrates are fixed due to an incoming flux of material, i.e. the substrates cannot ever be exhausted.

On the left in Figure 8, the time evolutions of protocell populations obtained by stochastic simulations are reported for the three studied cases starting with a genome made up of 1, 10 and 100 dimers of both R_{c}R_{L} and R_{c}R_{P}. At the end of the simulations of all three cases, similar compositions of the protocell population are obtained with low percentages of real ribocells (3.3-6.7%) while the most populated fractions are those of empty (40.0-41.7%) self-producing (26.7-33.3%) and broken (18.3-25.0%) vesicles, respectively. Reduced ribocells are present only in the first generations since they very soon decay into self-producing and empty vesicles. Inert vesicles, i.e. vesicles entrapping free chains of _{c}R_{P} and/or _{c}R_{L} or a single R_{P}, are not formed and this can be ascribed to the high stability of RNA dimers and complexes so that the chance of finding free RNA monomers at the time of vesicle division is extremely improbable. Thus, the three studied cases are differentiated by their time behavior rather than by the final protocell population, as confirmed by the plots on the right in Figure 8, where the average division time <Δt_{n} > and the number of dividing protocells are reported against the generation number. In agreement with deterministic predictions, for the first generations the average division times <Δ*t*_{n} > are higher for ribocells starting with a lower initial number *N*_{0} of dimers although, in all cases, the deterministic Δ*t*_{n} (red triangles) are greater than the stochastic averages (black circles with error bars). This can be partially ascribed to the fact that the average <Δ*t*_{n} > is calculated on all the protocells that undergo the *n*-th division and only some of them are real ribocells. In fact, generation by generation, the protocell population is enriched by self-producing vesicles that can divide more quickly if a free R_{L} monomer is present and this lowers the average division time.

As an example, Figure 9 reports the time behavior of a single Ribocell with a starting genome made up of just one R_{c}R_{L} and one R_{c}R_{P}. In the upper plot, a comparison between the reduced surface time course determined deterministically and simulated stochastically is shown. As can be seen, the stochastic time trend presents a very irregular time behavior compared to the deterministic one that describes a highly synchronized oscillating regime of growth and division. In contrast, stochastic simulations highlight the alternation of dormant phases, where the reduced surface remains practically constant, both the core volume and the membrane surface being constant (data not shown), to very active steps where protocell growth takes place very fast, leading to a division event.

In order to account for this behavior, in Figure 10 the time course of the total number of lipase R_{L} and polymerase R_{P} ribozymes present as monomers, complementary strands, dimers and complexes, are reported against time, along with the number of free R_{L} and R_{P} monomers that exhibit catalytic activity. As can be seen, the fast growth and division step corresponds to the presence in the vesicle core of a free R_{L} chain while, in the dormant phase, ribozymes are all coupled in the form of dimers or complexes. In fact, during RNA template transcription in the first generation life time, the volume remains practically constant since the amount of waste molecules produced is not sufficient to promote a substantial water flux from the external environment. As a consequence, self-producing vesicles with a genome made up only of R_{L} monomers can reproduce very efficiently since no dormant phase can occur, given that the formation of R_{c}R_{L} dimers is impossible. This is what happens at high generation numbers in the protocell population time evolutions reported in Figure 8. It is clear in both cases with a starting genome *N*_{0} equal to 1 and 10. In fact, at high generation numbers, the only dividing protocells are self-producing vesicles that present free R_{L} monomers in the core volume. Although there are very few of these protocells, they can divide very efficiently, with a Δ*t* of around 0.81 days, and with division times that are very close to one another, so the population average <Δ*t*_{n} *>* seems very low with a small error bar as shown by the upper and middle plots on the left in Figure 8.

## Conclusions

In this paper, we applied an already published Ribocell *in silico* model [15, 16] to the case of two hypothetical ribozymes 100 nucleotides long by using the kinetic parameters reported in Table 2. The kinetic constant for the lipid formation, *k*_{L}, was set equal to 1.7E+3s^{-1}M^{-1}, the lowest value that exhibited a stationary regime of growth and division in a previous work [16] in which ribozymes 20 nucleotide long were assumed. Length of 100 nucleotides was chosen as a compromise between the need to reduce calculation times and the choice of a plausible ribozyme size, keeping in mind that the RNA subunits in ribosomes devoted to protein elongation have a comparable length [18, 19].

By means of deterministic analysis, the robustness of the stationary regime was also investigated as a function of the initial conditions, the length of ribozymes and the kinetic constants of the RNA dimerization. For 100-base long ribozymes, the best experimental conditions in terms of the external concentrations have been found: [*N*_{ex}] = [*P*_{ex}] = 1.0E-2M and [*I*_{ex}] = 0.3M in order to observe a stationary regime with the lowest division time: Δ*t*_{20} = 68.3 days. A so high protocell life time can be mainly ascribed to the RNA reversible association that is shifted towards the dimer formation rendering the concentration of the catalyzers R_{L} and R_{P} very low. This has been confirmed by analyzing the dependence of the life cycle on the thermodynamic constant of ribozyme dimerization. A small influence on the stationary regime was observed on changing the length of the RNA strands: an increase in filament size determines higher life times and a lower amount of genetic material. This effect is more pronounced when the R_{P} length changes. On the other hand, the external concentrations of substrates and inert compounds appear to highly affect the stationary regime in terms of vesicle size, genetic material amount and genome composition. Moreover, deterministic calculations have also shown that the stationary regime can be reached from very different initial genome composition, although when too much free R_{L} ribozymes are present at the beginning the death for ribozymes segregation can be observed since the protocell divides too quickly before the genome replication. Stochastic simulations have been done starting from a population of 320 identical ribocells with an initial genome composed by *N*_{0} = 1,10 and 100 dimers of both R_{c}R_{L} and R_{c}R_{P}. The analysis of simulations outcomes shows that the ribocell time behaviors is highly influenced by random fluctuations. Since the genetic material is randomly distributed at each cell division, this can produce different type of protocells, ranging from empty vesicles to genuine ribocells, their internal metabolism being highly influenced by the presence of the catalytic RNA strands. In fact, deterministic analysis cannot take into account the disappearance of ribozymes due to a vesicle division, since this approach simply halves the genetic amount and follows the reacting molecule time courses in terms of population averages, i.e. real positive numbers that can be less than one without being zero. On the other hand, the stochastic simulations are more realistic to the random loss of ribozymes from the genome being capable of describing a population of protocells with completely different time behaviors. As a consequence, the simulation outcomes show that ribocells are not enough robust to survive to random fluctuations. In fact only about the 5% of the initial population survive as genuine ribocells after 15-25 generations and on a longer time window they are destined for extinction. Furthermore, the time course of each single protocell is also greatly influenced by intrinsic stochasticity in particular by the time fluctuations of the RNA dimer dissociation. In fact, when all the RNA strands are associated in dimers, protocells remain in a lazy phase, whereas free R_{L} monomers induce fast growth and division steps and free R_{P} cause the fast RNA replication without changing the vesicle size appreciably. Therefore these two process are synchronized only by chance and this also represents a reason of weakness of this model protocell.

In order to implement experimentally ribozymes-based minimal cells two main improvements are necessary. As first, more free monomers of both R_{L} and R_{P} must be available in the vesicle core so that the ribocell life cycle will be speeded up and the division time lowered. This can be achieved by increasing the working temperature since it has been recently show that fatty acid vesicles are stable up to 90°C [33] and the efficiency of the self-catalyzed replication of RNA strands increases with a temperature rise. This is also in agreement with our results since the *k*_{SS} value used can be considered as the appropriate 100-base long RNA association constant for a higher temperature than 25°C [34]. For a more detailed theoretical analysis at high temperature, it is necessary, of course, to estimate the kinetic constants for all the steps involved in the internal metabolism. This will be the topic of a future work. The second necessary improvement consists in finding a way to really synchronize the genome self-replication and the membrane reproduction. This is a much more complex task to achieve. Working with high concentration of the genetic material, it can avoid, or at least reduce, the ribozymes segregation and this should be compatible with a high working temperature. The best strategy could be to have a fine control of the R_{c}R_{L} dissociation since when free R_{L} monomers are present in the aqueous core the membrane growth quickly and the division takes place very soon. Thus the R_{c}R_{L} dissociation can act as a trigger for the membrane growth and division.

Finally rephrasing the George Box famous sentence, we are aware that this *in silico* Ribocell model is in a some way wrong, but we hope it might inspire researchers involved in the lab implementation of the ribozymes-based minimal cell.

## References

Luisi PL, Stano P:

*The Minimal Cell*. Dordrecht, Springer; 2011.Noireaux V, Maeda YT, Libchaber A:

**Development of an artificial cell, from self-organization to computation and self-reproduction.***PNAS*2011,**108:**3473. 10.1073/pnas.1017075108Chen IA, Walde P:

**From Self-Assembled Vesicles to Protocells.***Cold Spring HarbPerspectBiol*2010,**2:**a002170.Luisi PL, Ferri F, Stano P:

**Approaches to semi-synthetic minimal cells: a review.***Naturwissenschaften*2006,**93:**1–13. 10.1007/s00114-005-0056-zDeamer D:

**A giant step towards artificial life?***Trends in Biotechnology*2005,**23:**336–338. 10.1016/j.tibtech.2005.05.008Fry I:

**The Role of Natural Selection in the Origin of Life.***Orig Life EvolBiosph*2011,**41:**3–16. 10.1007/s11084-010-9214-1Luisi PL:

**Chemical aspects of synthetic biology.***Chemistry & Biodiversity*2007,**4:**603–621. 10.1002/cbdv.200790053Pohorille A, Deamer D:

**Artificial cells: prospects for biotechnology.***Trends in Biotechnology*2002,**20:**123–128. 10.1016/S0167-7799(02)01909-1Tarahovsky YS:

**"Smart" liposomal nanocontainers in biology and medicine.***Biochemistry (Mosc)*2010,**75:**811–824. 10.1134/S0006297910070023Joyce G:

**Foreward.**In*Origins of life: The central concepts*. Edited by: Deamer DW, Fleischaker GR. Boston, Jones and Bartlett Publishers; 1994:xi-xii.Luisi PL:

**Autopoiesis: a review and a reappraisal.***Naturwissensch*2003,**90:**49–59.Luisi PL:

**About various definitions of life.***Orig Life EvolBiosph*1998,**28:**613–622. 10.1023/A:1006517315105Szostak JW, Bartel DP, Luisi PL:

**Synthesizing life.***Nature*2001,**409:**387–390. 10.1038/35053176Schrum JP, Zhu TF, Szostak JW:

**The Origins of Cellular Life.***Cold Spring Harbor PerspectBiol*2010,**2:**a002212. 10.1101/cshperspect.a002212Mavelli F, Della Gatta P, Cassidei G, Luisi PL:

**Could the Ribocell be a Feasible Proto-cell Model?**In*Proceedings of Workshop OQOL' 09: Open Questions on the Origins of Life:20–23 May 2009: San Sebastian*Edited by: Ruiz-Mirazo K, Luisi PL. 2010,**40**(Special Issue):459–464. Orig Life Evol Biosph Orig Life Evol BiosphMavelli F:

**Theoretical Approaches to Ribocell Modeling.**In*Minimal Cell*. Edited by: Luisi PL, Stano P, Dordrecht. Springer; 2011:255–273.Mavelli F, K Ruiz-Mirazo K:

**Stochastic simulations of minimal self-reproducing cellular systems.***Phil Trans Royal Soc London Series B*2007,**362:**1789–1802. 10.1098/rstb.2007.2071Smith TF, Lee JC, Gutell RR, Hartman H:

**The origin and evolution of the ribosome.***Biol Direct*2008,**3:**16–29. 10.1186/1745-6150-3-16Agmon I:

**The Dimeric Proto-Ribosome:Structural Details and Possible Implications on the Origin of Life.***Int J Mol Sci*2009,**10:**2921–2934. 10.3390/ijms10072921Chizmadzhew YA, Maestro M, Mavelli F:

**A simplified kinetic-model for an autopoietic synthesis of micelles.***ChemPhysLett*1995,**226:**56–62.Mavelli F, Luisi PL:

**Autopoietic self-reproducing vesicles: a simplified kinetic model.***J Phys Chem*1997,**41:**16600–16607.Mavelli F, Piotto S:

**Stochastic simulations of homogeneous chemically reacting systems.***J Mol Struct: THEOCHEM*2006,**771:**55–64. 10.1016/j.theochem.2006.03.016Gillespie DT:

**Stochastic simulation of chemical kinetics.***AnnuRevPhysChem*2007,**58:**35–55.Mavelli F, Maestro M:

**Stochastic simulations of micellization kinetics.***J Chem Phys*1999,**111:**4310–4318. 10.1063/1.479729Mavelli F, Stano P:

**Kinetic models for autopoietic chemical systems: the role of fluctuations in a homeostatic regime.***Phys Biol*2010,**7:**16010. 10.1088/1478-3975/7/1/016010Mavelli F, Ruiz-Mirazo K:

**ENVIRONMENT: a computational platform to stochastically simulate reacting and self-reproducing lipid compartments.***Phys Biol*2010,**7:**36002. 10.1088/1478-3975/7/3/036002Chen IA, Roberts RW, Szostak JW:

**The emergence of competition between model protocells.***Science*2004,**305:**1474–1476. 10.1126/science.1100757Christensen U:

**Thermodynamic and kinetic characterization of duplex formation between 2'-O, 4'-C-Methylene-modified oligoribonucleotides, DNA and RNA.***Bioscience Reports*2007,**27:**327–333. 10.1007/s10540-007-9056-xTsoi PY, Yang M:

**Kinetic study of various binding modes between human DNA polymerase β and different DNA substrates by surface-plasmon-resonance biosensor.***Biochem J*2002,**361:**317–325. 10.1042/0264-6021:3610317Stage-Zimmermann TK, Uhlenbeck OC:

**Hammerhead ribozyme kinetics.***RNA*1998,**4:**875. 10.1017/S1355838298980876Mansy SS, Schrum JP, Krishnamurthy M, Tobé S, Treco DA, Szostak JW:

**Template-directed synthesis of a genetic polymer in a model protocell.***Nature*2008,**454:**122-U10. 10.1038/nature07018Sacerdote MG, Szostak JW:

**Semipermeable lipid bilayers exhibit diastereoselectivity favoring ribose.***PNAS*2005,**102:**6004–6008. 10.1073/pnas.0408440102Mansy SS, Szostak JW:

**Thermostability of model protocell membranes.***PNAS*2008,**105:**13351–13355. 10.1073/pnas.0805086105Lincoln TA, Joyce GF:

**Self-Sustained Replication of an RNA Enzyme.***Science*2009,**323:**1229–1232. 10.1126/science.1167856

## Acknowledgements

This work has been funded by the MIUR (PRIN2008FY7RJ4_002). Parallel calculations were done at CINECA Super Computing Italian Centre thanks to the approved ISCRAC project (HP10CVJLGZ). The authors would like to thank Anthony Green for his careful review of the English in this manuscript.

This article has been published as part of *BMC Bioinformatics* Volume 13 Supplement 4, 2012: Italian Bioinformatics Society (BITS): Annual Meeting 2011. The full contents of the supplement are available online at http://www.biomedcentral.com/1471-2105/13/S4.

## Author information

### Authors and Affiliations

### Corresponding author

## Additional information

### Competing interests

The authors declare that they have no competing interests.

## Electronic supplementary material

### 12859_2012_5107_MOESM1_ESM.docx

Additional file 1: Deterministic Outcomes of the Ribocell time behavior: stationary values for different initial conditions (DOCX 22 KB)

## Rights and permissions

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## About this article

### Cite this article

Mavelli, F. Stochastic simulations of minimal cells: the Ribocell model.
*BMC Bioinformatics* **13**
(Suppl 4), S10 (2012). https://doi.org/10.1186/1471-2105-13-S4-S10

Published:

DOI: https://doi.org/10.1186/1471-2105-13-S4-S10