- Research article
- Open Access
Stochastic sequence-level model of coupled transcription and translation in prokaryotes
© Mäkelä et al; licensee BioMed Central Ltd. 2011
- Received: 19 November 2010
- Accepted: 26 April 2011
- Published: 26 April 2011
In prokaryotes, transcription and translation are dynamically coupled, as the latter starts before the former is complete. Also, from one transcript, several translation events occur in parallel. To study how events in transcription elongation affect translation elongation and fluctuations in protein levels, we propose a delayed stochastic model of prokaryotic transcription and translation at the nucleotide and codon level that includes the promoter open complex formation and alternative pathways to elongation, namely pausing, arrests, editing, pyrophosphorolysis, RNA polymerase traffic, and premature termination. Stepwise translation can start after the ribosome binding site is formed and accounts for variable codon translation rates, ribosome traffic, back-translocation, drop-off, and trans-translation.
First, we show that the model accurately matches measurements of sequence-dependent translation elongation dynamics. Next, we characterize the degree of coupling between fluctuations in RNA and protein levels, and its dependence on the rates of transcription and translation initiation. Finally, modeling sequence-specific transcriptional pauses, we find that these affect protein noise levels.
For parameter values within realistic intervals, transcription and translation are found to be tightly coupled in Escherichia coli, as the noise in protein levels is mostly determined by the underlying noise in RNA levels. Sequence-dependent events in transcription elongation, e.g. pauses, are found to cause tangible effects in the degree of fluctuations in protein levels.
- Translation Initiation
- Synonymous Codon
- Ribosome Binding Site
- Transcription Elongation
- Stochastic Simulation Algorithm
In prokaryotes, both transcription and translation are stochastic, multi-stepped processes that involve many components and chemical interactions. Several events in transcription and in translation [1–8] are probabilistic in nature, and their kinetics are sequence dependent. One example is sequence-dependent transcriptional pausing . When they occur, these events can affect the degree of fluctuations of RNA and protein levels. Since noise in gene expression affects cellular phenotype, sequence dependent noise sources are subject to selection [9, 10] and are thus evolvable . Recent evidence suggests that these noise sources may be key for bacterial adaptability in unpredictable or fluctuating environmental conditions [11, 12].
To better understand the evolvability of bacteria, it is important to understand how fluctuations in RNA levels propagate to protein levels. Transcription and translation are coupled in prokaryotes, in that translation can initiate after the formation of the ribosome binding site region of the RNA, which occurs during the initial stages of transcription elongation. The extent to which sequence-dependent events in transcription elongation affect the noise in RNA, and consequently protein levels is largely unknown. Due to this, it is also not yet well understood how phenotypic diversity is regulated in monoclonal bacterial populations.
Two recent experiments have given a preliminary glimpse at the dynamics of production of individual proteins  and RNA molecules in vivo in bacteria. However, as of yet, there is no experimental setting to simultaneously observe the production of both RNA and proteins at the molecular level. Further, in the aforementioned experiments [13, 14], the rate of gene expression was kept very weak, as otherwise the number of molecules would not be easily quantifiable. This implies that they cannot be used to study the effects of events such as the promoter open complex formation . The present shortcomings of these techniques enhance the need for realistic models of gene expression in prokaryotes.
Several measurements have shed light on the dynamics of transcription and translation elongation [16, 17], and revealed the occurrence of several stochastic events during these processes, such as transcriptional pauses [2, 4]. The kinetics of RNA and protein degradation are also better known . These measurements allowed the recent development of realistic kinetic models of transcription at the nucleotide level [5, 19] and translation at the codon level . These models were shown to match the measurements of RNA production at the molecule level [6, 21] and of translation elongation dynamics at the codon level . In this regard, it was shown that measurements of sequence dependent translation rates of synonymous codons could be modeled with neither deterministic nor uniform stochastic models , thus the need for models with explicit translation elongation. Similarly, transcription elongation also needs to be modeled explicitly to accurately capture the fluctuations in RNA levels for fast transcription initiation rates [5, 19, 22].
Here, we propose a model of transcription and translation at the nucleotide and codon level for Escherichia coli. The model of transcription is the same as in , and includes the promoter occupancy time, transcriptional pausing, arrests, editing, premature termination, pyrophosphorolysis, and accounts for the RNAp footprint in the DNA template. The model of translation at the codon level proposed here is based on the codon-dependent translation model proposed in , which includes translation initiation, codon-specific translation rates and the stepwise translation elongation and activation. The model also accounts for the ribosome's footprint in the RNA template as well as the occupancy time of the ribosome binding site. Here, beside these features, we further include the processes of back-translocation, drop-off, and trans-translation. Finally, we include protein folding and activation, as well as degradation, modeled as first-order processes, so as to study fluctuations in the protein levels.
The dynamics of the model follow the Delayed Stochastic Simulation Algorithm [19, 23] and is simulated by a modified version of SGNSim . While the most relevant innovation is the coupling between realistic stochastic models of transcription and translation at the nucleotide and codon levels, which allows the study of previously unaddressed aspects of the dynamics of gene expression in prokaryotes, this introduces a level of complexity that required simulation capabilities that SGNSim did not possess. Namely, the simulator is required to create and destroy compartments at run time within the reaction vessel, where a separate set of reactions can occur.
We start by validating the dynamics of translation elongation in the model. Next, using realistic parameter values extracted from measurements, we address the following questions: how different are the distributions of time intervals between translation initiation events and between translation completion events, i.e., how stochastic is translation elongation? To what extent do fluctuations in temporal RNA levels propagate to temporal protein levels, and what physical parameters control this propagation of noise between the two? Finally, we investigate whether transcriptional pauses have a significant effect on the dynamics of protein levels.
Dynamics of transcript production
Given the number of chemical reactions per nucleotide in the model and that one gene can have thousands of nucleotides, the dynamics are considerably complex. To illustrate this, we show examples of the kinetics of multiple RNAps on a DNA strand within a short time interval, and the dynamics of multiple ribosomes on one of the RNA strands as it is transcribed. Parameter values were obtained from measurements in E. coli for LacZ (see methods section), since the dynamics of transcription and translation have been extensively studied for this gene. LacZ has 3072 nucleotides and its transcription is controlled by the lac operon.
Reactions modeling transcription
Initiation and promoter complex formation (1)
k init = 0.015
τ oc = 40 ± 4
Promoter clearance (2)
k m = 114
k m = 114
k a = 114, n>10,
k a = 30, n≤10
k p = 0.55
τ p = 3
Pause release due to collision (6)
k m = 114
Pause induced by collision (7)
k m = 114
k ar = 0.00028
τ ar = 100
k ec = 0.008
τ c = 5
Premature termination (10)
k pre = 0.00019
k pyro = 0.75
k f = 2
mRNA degradation (13)
k dr = 0.011
Figure 1B shows the distribution of the time intervals between transcription initiation events, which is Gaussian-like, due to the open complex formation step. The longer tail on the right side of the distribution is mainly due to the contribution of the time it takes for the RNAp to bind to the template, a bimolecular reaction whose expected time to occur follows an exponential distribution with a mean of 2.5 s [26, 27].
Figure 1C shows the distribution of time intervals between transcription completion events in the same simulation as Figure 1B. This distribution is strikingly different from that of Figure 1B due to the stochastic events in transcription elongation. Pauses, arrests and other stochastic events cause the distribution to be bimodal due to the bursty dynamics (many short intervals and some long intervals). When these probabilistic events occur to some RNAp molecules, they significantly alter the distances in the strand between consecutive RNAps. For example, when one RNAp pauses, its distance to the preceding RNAp increases, while the distance to subsequent RNAps shortens, allowing completion events to be separated by intervals shorter than the promoter delay.
Dynamics of production of proteins
Figure 2B shows the distribution of intervals between translation initiation events. Since there is no significant delay in translation initiation (as the one due to the promoter open complex formation), this distribution is exponential-like. Figure 2C shows the corresponding distribution of intervals between translation completion events (grey bars), given the presence of a sequence dependent arrest site at nucleotide 1850. This distribution, while resembling that of Figure 2B, shows more short time intervals, due to the long arrest in transcription elongation. For comparison, we also show a distribution of intervals between translation completion events drawn from cases without the sequence dependent arrest in transcription (solid black line). The difference between the two distributions illustrates how events in transcription elongation (e.g. a sequence dependent arrest site) can significantly affect the dynamics of translation.
Comparing the dynamics of the model of translation with measurements
Recently, the real-time expression of a lac promoter was directly monitored in E. coli with single-protein resolution . The proteins were found to be produced in bursts (i.e. several proteins being produced from each RNA), with the distribution of intervals between bursts fitting an exponential distribution, while the number of proteins per burst followed a geometric distribution . These distributions were measured for a gene that was kept strongly repressed and for which the ribosome binding site (RBS) was engineered so that translation was also very weak . Under these conditions, our model reproduces these dynamics (data not shown). Nevertheless, we note that it is possible to match these measurements with a simpler model than the one proposed here, where transcription and translation are modeled as single step events [21, 23].
We next compare the kinetics of translation in our model with measurements of the translation elongation speed in three engineered E. coli strains designed to enhance queue formation and traffic in translation . Each strain contains a different mutant of LacZ. The pMAS23 strain corresponds to the wild-type lacZ. The other two sequences differ in that a region of slow-to-translate codons was inserted (~24 in pMAS-24GAG and ~48 in pMAS-48GAG). The speed of protein chain elongation was measured by subjecting the cells to a pulse of radioactive methionines, and then measuring the level of radioactivity in cells of each population, every 10 s after the pulse. Each strand contained 23 methionines, spread out unevenly on the DNA sequence, causing the incorporation curve to be non-linear.
Given that they differ in the nucleotide sequence, it was hypothesized that the translation elongation speed of the three strands would differ, as the speed of incorporation of an amino acid depends on which synonymous codon is coding for it . The cells where translation is faster will thus be expected to have higher levels of radioactivity in the translated proteins, as more labeled amino acids have been incorporated in a fixed time interval. If the translation speeds of the three strands were identical, they would exhibit identical levels of radioactivity at the same point in time.
Propagation of fluctuations in RNA levels to protein levels
We simulate the model for varying effective rates of transcription initiation (denoted keff). This rate is determined by the basal rate of transcription initiation (kinit), which sets the binding affinity of the RNAp to the transcription start site, and by the strength of repression of transcription. Thus, to vary keff, we vary the number of repressor molecules present in the system. Three sets of simulations are performed, differing in rate of translation initiation (ktr). This rate is one of the kinetic parameters of the model, thus can be changed directly, and not by indirect means as keff. In E. coli genes, this rate is believed to be determined by the RBS sequence . mRNA and protein degradation rates are set so that the mRNA and protein mean levels are identical for all cases, allowing us to study how the level of noise in mRNA and protein levels changes.
For each set of values of keff and ktr we perform 100 independent simulations. Depending on these rates, the mean time to reach steady state differs. Each case is simulated for long enough to reach steady state and for an additional 100 000 s after that. The time series of the 100 simulations for each set of parameter values is concatenated into one time series, from which the noise is quantified by the square of the coefficient of variation, CV2 (variance over the mean squared) . This number of long simulations is necessary to properly sample the system due to the stochasticity of the underlying processes.
In general, we find that increasing keff decreases the noise in protein levels due to the decrease of noise in mRNA levels. Increasing ktr increases the noise in protein levels, due to the increased size of the bursts in the protein level [8, 29]. This finding has not yet been experimentally validated by direct means.
An interesting observation from Figures 4 and 5 is that, for keff < 5 × 10-4 s-1, as keff is increased, the noise in protein levels decreases significantly, while the noise in RNA levels does not noticeably change. This is due to the decrease in mean protein burst size, i.e., the mean number of proteins produced from each RNA molecule, as both keff and the degradation rate of RNA molecules are varied.
The correlation value is largely determined by the rates of mRNA and protein degradation and production. For example, both increasing the mRNA degradation rate and/or decreasing the protein degradation rate increases the time averaging constant of the mRNA fluctuations, and thus decreases the correlation between mRNA and protein levels. In general, if the mean mRNA and protein levels and kept unchanged by tuning their degradation rates accordingly, the correlation between RNA and protein time series can be increased by lowering the mRNA production rate and/or increasing the protein production rate.
Effects of transcriptional pauses on the fluctuations in protein levels
Recent work  reported that long transcriptional pauses enhance the noise in mRNA levels. We next investigate to what extent the fluctuations in RNA levels caused by long transcriptional pauses propagate to protein levels. Long sequence-dependent pauses [16, 30, 31] in transcription elongation may cause the ribosome to stall in the mRNA chain. This will likely cause subsequent ribosomes to accumulate in the preceding sequence. When the RNAp is spontaneously released from the pause , translation of the stalled ribosomes likely resumes but the distribution of intervals between them will differ significantly from what it would have been without the pause event. Consequently, the protein production is likely to become burstier, especially if the long pause site is located near the end of the sequence. An increase in burstiness ought to increase the noise in protein levels.
To verify this, we perform two simulations. We introduce a long-pause sequence with mean pause durations of 500 s in one case, and 100 s in the other (both values are within realistic intervals ). In both cases, we set the probability that an RNAp will pause at that site to 70% (identical to the value for his pause sites ).
Measuring the protein noise levels, we find that the CV2 is ~5% higher for the 100 s pause site and ~10% higher for the 500 s pause site, in comparison to the same sequence without any sequence specific long-pause site. These relative differences can be biologically relevant in that such a change may, in some cases, cause the degree of phenotypic diversity of a monoclonal cell population to change.
The effects of several pause sites on the same strain are cumulative, namely, the higher the number of pause sites, the higher the noise in RNA levels . Combined with the present results, this leads us to the conclusion that the sequence-dependent transcriptional pausing mechanism likely exists to allow a wide variation of both RNA and protein noise levels.
We proposed a new delayed stochastic model of prokaryotic transcription and translation at the single nucleotide and codon level, where the processes of transcription and translation are dynamically coupled in that translation can initiate immediately upon the formation of the ribosome binding site region of the nascent mRNA. Simulations of the model's dynamics show that, within realistic parameter values, the protein noise levels are determined, to a great extent, by the fluctuations in the RNA levels, rather than from sources in translation, in agreement with indirect measurements , as translation elongation was found to be less stochastic than transcription elongation. Specifically, the distributions of intervals between translation initiation and translation completion events only differ significantly if the sequence possesses long sequence-dependent pauses or clusters of slow-to-translate codons. The sequence dependence of several mechanisms that can act as generators of strong fluctuations in RNA levels , the propagation of these fluctuations to protein levels, and the ability of fluctuations in protein levels to affect cellular phenotype , suggest that these mechanisms may be evolvable.
As a previous study has suggested , the translation initiation rate was found to be key in determining the degree of coupling between the fluctuations in RNA and protein levels, if one assumes that the degradation rate of the proteins is changed accordingly to maintain their mean level unchanged. Varying this sequence-dependent, and thus, evolvable parameter  within realistic ranges gave a widely varying degree of coupling between the fluctuations in RNA and protein levels. It is therefore not necessarily true that noisy production of RNA molecules results in noisy protein levels. Interestingly, while decreasing the coupling between transcription and translation by decreasing the rate of translation initiation causes the protein levels to become less noisy, it also takes longer for a change in RNA levels to be followed by the protein levels. This suggests that to be able to change rapidly in response to, e.g., environmental changes, the levels of a protein will be necessarily noisier.
Confirming previous studies [1, 5, 8, 19], we found that the distributions of time intervals between transcription initiation and completion events differ significantly and that the faster the rate of transcription initiation events, the more they differ. This implies that in the regime of fast transcription, both the transcription and translation elongation processes need to be modeled explicitly and coupled, if one is to match the mean and fluctuations in the protein levels at the molecular level. This is of relevance, since bursts in protein levels may trigger many processes, such as phenotypic differentiation [33, 34]. A final justification for using the model proposed here is the complexity of the process of gene expression in E. coli, and the fact that many events therein may or may not affect the temporal RNA and protein levels significantly, depending on their specific sequence-dependent features. Such effects, due to the complexity of the system, are not easily predictable without performing explicit numerical simulations.
The model proposed here includes several features not included in previous models such as a gradual degradation event that can be triggered while the RNA is still being transcribed. As its parameter values were extracted from measurements, it should be useful in the study of several aspects of the dynamics of gene expression in prokaryotes that cannot yet be measured directly and to explore the state space of gene expression dynamics by varying any of the physical variables within realistic ranges.
However, the present model does not yet account for known effects of ribosomes on the dynamics of transcription elongation. These might need to be included in future developments of the proposed model as recent results [27, 35] suggest that the rate of translation elongation can affect the rate of transcription elongation, due to possible interactions between the ribosome that first binds to the mRNA and the RNAp transcribing it. Possible effects may include facilitating the release of paused RNAp's, which could affect the degree of the contribution of pauses to the noise in RNA and thus protein levels. We do not exclude the possibility that the contrary may occur in specific cases, that is, that the paused state of the RNAp may cause pauses in the ribosome translational dynamics, which would amplify the effect of transcriptional pauses on the fluctuations of protein levels. Whether the pause is ubiquitous or due to loop formations in the nascent RNA may affect the results of the interaction as well. Provided experimental evidence on the nature and consequences of these interactions, once included in the model, we may be able to test, among other things, whether long transcriptional pauses located in an attenuator system provide an additional layer of control over premature transcription terminations, and thus over RNA and protein noise levels.
Model of transcription, one nucleotide at a time
We model the dynamics of gene expression as in . This model was shown  to match the dynamics of RNA and protein production at the single molecule level . The dynamics of the system of chemical reactions is driven by the delayed stochastic simulation algorithm (delayed SSA ) so as to include events whose time of completion once initiated is non negligible, in that it affects the dynamics of production of RNA and protein molecules. Specifically, several steps in gene expression, such as the promoter open complex formation, are time consuming . To include these events when simulating gene expression, the delayed SSA was proposed .
All simulations are executed by an extended version of SGNSim  to allow multiple coupled chain elongation processes to run in parallel on each elongating RNA strand. The extension consists in providing the simulator with the ability to introduce new chemical reactions at run time (that is, those corresponding to the translation of each individual RNA strand).
The delayed stochastic model of transcription at the nucleotide level  includes the promoter occupancy time, pausing, arrests, editing, premature terminations, pyrophosphorolysis, and accounts for the RNAp footprint in the DNA template . Additional reactions model the stepwise forward movement and activation of the RNAp, pausing and unpausing of the RNAp due to collisions with adjacent RNAps, release of the promoter when the RNAp begins elongation, and error correction.
The reactions, stochastic rate constants and time delays, are shown in Table 1 and described in detail in [5, 37–41]. Here, Pro stands for the promoter region, RNAp for the RNA polymerase, and RNAp·Pro for the promoter region occupied by an RNAp. A n , O n and U n stand for the n th nucleotide when activated, occupied, and unoccupied, respectively. Ranges of nucleotides are denoted such as U[start, end], denoting a stretch of unoccupied nucleotides from indexes start to end. , and are used to represent a paused, arrested, or error correcting RNAp at position n. On the template, each RNAp occupies (2ΔRNAp+1) nucleotides, where ΔRNAp = 12. These nucleotides cannot be occupied by any other RNAp at the same time. denotes transcribed ribonucleotides which are free (i.e., not under the RNAp's footprint). These transcribed ribonucleotides are created in a separate part of the simulation (denoted by the R superscript), one separate set per RNA strand, so that we can simulate the translation of all individual RNA molecules independently and simultaneously.
We use a delayed reaction event to model the first step in transcription, the promoter closed and open complex formation (1). These processes could instead be modeled by a set of non-delayed, consecutive, reactions . We use a delayed reaction as it was shown to accurately model the dynamics of this process [19, 21, 23]. The duration of this step likely varies from one event to the next, but while values for the mean duration are known, as of yet, there are no exact measurements of the standard deviation. Nevertheless, it is likely small compared to the mean, given the very small standard deviations of promoter activity . For these reasons, we set the promoter delay, τ oc , as a random variable, following a normal distribution with a mean of 40 s and a standard deviation of 4 s, whose value is randomly drawn each time a transcription event occurs.
Once the first nucleotide is occupied via reaction (2), stepwise elongation can begin (3). Also, as soon as the promoter is released, a new transcription initiation event can occur. Following each elongation step (3), an activation step occurs (4), which is necessary for the RNAp to move along the template to the next nucleotide. The following events compete with stepwise elongation: pausing (5) and (7), released via (5) or (6), arrests and their release (8), editing (9), premature terminations (10), and pyrophosphorolysis (11).
At the end of the elongation process, the RNAp is released (12). mRNA degradation is modeled, for simplicity, as a first order reaction (13). When (13) occurs, the first few ribonucleotides of the RNA are immediately removed from the system, preventing any new translation event . Thus, we model the degradation process such that it begins in the vicinity of the RBS and then gradually cuts the mRNA as it is being released from the ribosomes. This allows the translating ribosomes to complete protein production before the whole mRNA is degraded. When the final ribosome unbinds from the RNA, the rest of the RNA strand, denoted by R in reaction (13), is destroyed.
If the model of RNA degradation was such that some of the ribosomes on the RNA template fell off when degradation begins (i.e. due to endonucleatic cleavage of the RNA chain at a random position ), one consequence would be the reduction of the mean protein burst size as these RNAs would contribute far fewer proteins than if the ribosomes were allowed to finish translating. This would likely result in a reduction of protein noise levels. Alternatively, the ribosome occupancy of the ribosome binding site might determine mRNA longevity . In this case, for the same mean burst size, the noise is expected to increase since large bursts will get larger and small bursts will get smaller, likely increasing protein noise levels. We opted not to include these additions to the degradation model since they are not yet well characterized .
For simplicity, we opted not to include this reaction in the simulations, and instead set a value for the rate of transcription initiation that matches realistic rates of RNA production. From the point of view of RNA production, since (2b) competes with reaction (2), it would be dynamically equivalent to decrease the rate of transcription initiation in (2) to account for the fraction of abortive initiations.
Model of translation, one codon at a time
Reactions modeling translation
k trans_init = 0.33
Stepwise translocation (15-17)
k tm = 1000
k transA = 35, k transB = 8,
k transC = 4.5
k bt = 1.5
k drop = 0.000114
k tt = 0.000052
Elongation completion (22)
k trans_f = 2
Folding and activation (23)
k fold = 0.0024
Protein degradation (24)
k dec = 0.0017
Translation has three main phases: initiation, elongation and termination. It begins with the binding of the ribosome complex to the mRNA strand. During elongation, the amino acids, determined by the RNA sequence, are added to the elongating peptide chain. Termination is the final step, as specific release factors detach the peptide and the RNA chain from the ribosome. E. coli has specific translation factors for each phase: initiation factors IF1, IF2 and IF3, elongation factors EF-G, EF-Tu and EF-Ts and three release factors RF1, RF2 and RF3 . These are not explicitly modeled, as they exist in abundance under normal conditions.
The binding of the ribosome to the ribosome binding site (RBS) of the RNA starts with the binding of the 30S ribosomal subunit to the nascent mRNA. After that, fMet-tRNA binds to the P-site forming a 30S complex. The 50S ribosome subunit attaches to it, forming the 70S initiation complex . This process is modeled as a single step reaction (14). The next ribosome can only to bind after the preceding one has moved away from the RBS. This implies that the initiation of two consecutive translation events is separated by a non-negligible time interval.
Translation elongation occurs through successive translocation-and-pause cycles . Translocation includes three steps (15-17), after which there is a pause (18), during which the bond between amino acids is formed. The time that (18) takes to occur accounts for this pause, which is much longer than the time for (15-17) to occur .
The genetic code contains two mechanisms for redundancy: some tRNAs can be charged with the same amino acid, and a single tRNA can recognize more than one codon due to a "wobble" effect in position three of the anti-codon . The net effect is that multiple codons code for the same amino acid. These codons are called synonymous codons. Synonymous codons read by the same tRNA have been shown to translate at significantly different rates , implying that our model must incorporate per-codon translation rates for reaction (18), rather than per-tRNA or per-amino acid rates. Only a few of these translation rates have been measured directly  but indirect assessment is available . In our case, we assume normal cellular conditions, including an abundance of charged tRNA, implying that we do not need to model the tRNA explicitly.
Since each codon is translated at a different rate, the codon frequency also needs to be accounted for explicitly . In the model, the sequence can either be randomly generated or selected from a known gene. In the former case, the sequence is randomly generated according to the known statistical frequency of each codon in E. coli.
The competing reactions of stepwise translation elongation are back-translocation (19), drop-off (20) and trans-translation (21), which are explicitly modeled. Back-translocation generally occurs when the tRNA has not yet locked into the peptide chain, causing the ribosome to move backwards on the mRNA template to the position of the previous codon. While the occurrence of back-translocation has been observed and can be promoted by certain antibiotics [50–52], its exact causes remain somewhat unknown. Nevertheless, the kinetic rates for translocation and back-translocation have been measured under various conditions . Alternatively, the ribosomes can randomly dissociate from the RNA, in a process called drop-off, modeled by reaction (20). The overall rate of drop-off has been measured in , from which we have inferred a per-codon rate.
Trans-translation is the process by which the ribosome is released from the RNA template after stalling, which can occur for a variety of reasons, such as the incorporation of an incorrect codon, premature mRNA degradation, or spontaneous frameshifting . Trans-translation is executed by the tmRNA that, together with SmpB and EF-Tu, binds to the A-site of the ribosome and releases it from the mRNA . Once the ribosome is released, the mRNA is degraded. In the model, stalling followed by trans-translation can occur spontaneously with a given probability at any codon via reaction (21). When this reaction occurs, the RNA strand is immediately destroyed in the simulation, and all translating ribosomes are released back into the cellular medium, denoted in reaction (21) by [RibR]Rib, where [RibR] denotes the number of ribosomes bound to the RNA at that moment.
Translation elongation continues until the STOP codon is reached (22), after which RF1 or RF2 binds and releases the ribosome together with RF3 . These are not modeled explicitly in the model. Its kinetic rate is higher than initiation, preventing queuing near the stop codon . Reaction (22) is followed by folding and activation (23), modeled as a first order process for simplicity . The rate of this reaction is set to model the maturation time of GFP, as most measurements of protein expression at the single cell level use this protein. Pprem denotes the unfolded protein, while P denotes the complete activated protein, which can then degrade via reaction (24).
Given the above, we note that the dynamics of transcription and translation are sequence dependent in the present model in the following ways. First, the model allows the insertion of, e.g., arrests or sequence specific pauses at a specific nucleotide (exemplified in the last section of the results section). In general, since the rates of all possible events are defined uniquely for each nucleotide, any event may be set to have a distinct propensity at a specific nucleotide rather than a constant rate for all nucleotides. Translation elongation is, in the same manner, sequence dependent, with the additional feature that the rates of elongation in this case are always codon dependent.
The chemical reactions and rate constants (in s -1 ) used to model translation initiation, elongation, and termination, as well as protein folding and activation and protein degradation are in Table 2. Parameter values were obtained from measurements in E. coli, mainly for LacZ.
Quantifying the correlation between protein and mRNA levels
Protein levels do not respond instantaneously to changes in the number of mRNA molecules in the system since new proteins take time to synthesize after a new mRNA is produced, and excess proteins take time to degrade after an mRNA has been degraded. Instead, the fluctuations in protein levels result from a time averaging of the fluctuations in mRNA levels . The degree to which fluctuations propagate from RNA to protein levels depends on various parameters, the most relevant being the ratio between the degradation rates of the proteins and RNAs. Changing this ratio is likely to affect the degree of correlation between the RNA and protein time series.
This work was supported by the Academy of Finland (JLP, ASR) and by the FiDiPro programme of Finnish Funding Agency for Technology and Innovation (JM, OYH, and ASR). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
- Rajala T, Häkkinen A, Healy S, Yli-Harja O, Ribeiro AS: Effects of transcriptional pausing on gene expression dynamics. PLOS Comput Biol 2010, 6(3):e1000704. 10.1371/journal.pcbi.1000704PubMed CentralView ArticlePubMedGoogle Scholar
- Greive SJ, von Hippel PH: Thinking quantitatively about transcriptional regulation. Nat Rev Mol Cell Biol 2005, 6: 221–232.View ArticlePubMedGoogle Scholar
- Wen JD, Lancaster L, Hodges C, Zeri AC, Yoshimura SH, Noller HF, Bustamante C, Tinoco I Jr: Following translation by single ribosomes one codon at a time. Nature 2008, 452: 598–603. 10.1038/nature06716PubMed CentralView ArticlePubMedGoogle Scholar
- Landick R: The regulatory roles and mechanism of transcriptional pausing. Biochem Soc Trans 2006, 34(6):1062–1066.View ArticlePubMedGoogle Scholar
- Ribeiro AS, Rajala T, Smolander OP, Häkkinen A, Yli-Harja O: Delayed Stochastic Model of Transcription at the Single Nucleotide Level. J Comput Biol 2009, 16: 539–553. 10.1089/cmb.2008.0153View ArticlePubMedGoogle Scholar
- Ribeiro AS, Häkkinen A, Mannerstrom H, Lloyd-Price J, Yli-Harja O: Effects of the promoter open complex formation on gene expression dynamics. Phys Rev E 2010, 81(1):011912.View ArticleGoogle Scholar
- Kaern M, Elston TC, Blake WJ, Collins JJ: Stochasticity in gene expression: from theories to phenotypes. Nat Rev Genet 2005, 6: 451–464. 10.1038/nrg1615View ArticlePubMedGoogle Scholar
- Pedraza J, Paulsson J: Effects of Molecular Memory and Bursting on Fluctuations in Gene Expression. Science 2008, 319: 339–334. 10.1126/science.1144331View ArticlePubMedGoogle Scholar
- Murphy KF, Balazsi G, Collins JJ: Combinatorial promoter design for engineering noisy gene expression. Proc Natl Acad Sci USA 2007, 104: 12726–12731. 10.1073/pnas.0608451104PubMed CentralView ArticlePubMedGoogle Scholar
- Mayr E: What evolution is. Basic Books, NY, USA; 2001.Google Scholar
- Lee HH, Molla MN, Cantor CR, Collins JJ: Bacterial charity work leads to population-wide resistance. Nature 2010, 467: 82–86. 10.1038/nature09354PubMed CentralView ArticlePubMedGoogle Scholar
- Acar M, Mettetal J, van Oudenaarden A: Stochastic switching as a survival strategy in fluctuating environments. Nature Genetics 2008, 40: 471–475. 10.1038/ng.110View ArticlePubMedGoogle Scholar
- Yu J, Xiao J, Ren X, Lao K, Xie XS: Probing gene expression in live cells, one protein molecule at a time. Science 2006, 311: 1600–1603. 10.1126/science.1119623View ArticlePubMedGoogle Scholar
- Golding I, Paulsson J, Zawilski SM, Cox EC: Real-time kinetics of gene activity in individual bacteria. Cell 2005, 123: 1025–1036. 10.1016/j.cell.2005.09.031View ArticlePubMedGoogle Scholar
- Ribeiro AS: Stochastic and delayed stochastic models of gene expression and regulation. Mathematical Biosciences 2010, 223(1):1–11. 10.1016/j.mbs.2009.10.007View ArticlePubMedGoogle Scholar
- Herbert KM, La Porta A, Wong BJ, Mooney RA, Neuman KC, Landick R, Block SM: Sequence-resolved detection of pausing by single RNA polymerase molecules. Cell 2006, 125: 1083–1094. 10.1016/j.cell.2006.04.032PubMed CentralView ArticlePubMedGoogle Scholar
- Sorensen MA, Pedersen S: Absolute in vivo translation rates of individual codons in Escherichia coli. J Mol Biol 1991, 222: 265–280. 10.1016/0022-2836(91)90211-NView ArticlePubMedGoogle Scholar
- Bernstein J, Khodursky A, Lin P, Lin-Chao S, Cohen S: Global analysis of mRNA decay and abundance in Escherichia coli at single-gene resolution using two-color fluorescent DNA microarrays. Proc Natl Acad Sci USA 2002, 99: 9697–9702. 10.1073/pnas.112318199PubMed CentralView ArticlePubMedGoogle Scholar
- Roussel MR, Zhu R: Validation of an algorithm for delay stochastic simulation of transcription and translation in prokaryotic gene expression. Phys Biol 2006, 3: 274–284. 10.1088/1478-3975/3/4/005View ArticlePubMedGoogle Scholar
- Mitarai N, Sneppen K, Pedersen S: Ribosome collisions and translation efficiency: optimization by codon usage and mRNA destabilization. J Mol Biol 2008, 382(1):236–245. 10.1016/j.jmb.2008.06.068View ArticlePubMedGoogle Scholar
- Zhu R, Ribeiro AS, Salahub D, Kauffman SA: Studying genetic regulatory networks at the molecular level: delayed reaction stochastic models. J Theor Biol 2007, 246: 725–745. 10.1016/j.jtbi.2007.01.021View ArticlePubMedGoogle Scholar
- Voliotis M, Cohen N, Molina-Paris C, Liverpool TB: Fluctuations, pauses and backtracking in DNA transcription. Biophys J 2008, 94: 334–348.PubMed CentralView ArticlePubMedGoogle Scholar
- Ribeiro AS, Zhu R, Kauffman SA: A general modeling strategy for gene regulatory networks with stochastic dynamics. J Comput Biol 2006, 13: 1630–1639. 10.1089/cmb.2006.13.1630View ArticlePubMedGoogle Scholar
- Ribeiro AS, Lloyd-Price J: SGN Sim, a Stochastic Genetic Networks Simulator. Bioinformatics 2007, 23(6):777–779. 10.1093/bioinformatics/btm004View ArticlePubMedGoogle Scholar
- Lutz R, Lozinski T, Ellinger T, Bujard H: Dissecting the functional program of Escherichia coli promoters: the combined mode of action of Lac repressor and AraC activator. Nuc Ac Res 2001, 29: 3873–3881. 10.1093/nar/29.18.3873View ArticleGoogle Scholar
- Gillespie DT: Exact stochastic simulation of coupled chemical reactions. J Phys Chem 1977, 81: 2340–2361. 10.1021/j100540a008View ArticleGoogle Scholar
- Arkin A, Ross J, McAdams H: Stochastic kinetic analysis of developmental pathway bifurcation in phage λ-infected E. coli cells. Genetics 1998, 149: 1633–1648.PubMed CentralPubMedGoogle Scholar
- Yarchuk O, Jacques N, Guillerez J, Dreyfus M: Interdependence of translation, transcription and mRNA degradation in the lacZ gene. J Mol Biol 1992, 226: 581–596. 10.1016/0022-2836(92)90617-SView ArticlePubMedGoogle Scholar
- Paulsson J: Models of stochastic gene expression. Phys Life Rev 2005, 2(2):157–175. 10.1016/j.plrev.2005.03.003View ArticleGoogle Scholar
- Shaevitz JW, Abbondanzieri EA, Landick R, Block SM: Backtracking by single RNA polymerase molecules observed at near-base-pair resolution. Nature 2003, 426: 684–687. 10.1038/nature02191PubMed CentralView ArticlePubMedGoogle Scholar
- Landick R: Transcriptional pausing without backtracking. Proc Natl Acad Sci USA 2009, 106(22):8797–8798. 10.1073/pnas.0904373106PubMed CentralView ArticlePubMedGoogle Scholar
- Ribeiro AS, Häkkinen A, Healy S, Yli-Harja O: Dynamical effects of transcriptional pause-prone sites. Comput Biol Chem 2010, 34(3):143–148. 10.1016/j.compbiolchem.2010.04.003View ArticlePubMedGoogle Scholar
- Choi PJ, Cai L, Frieda K, Xie XS: A Stochastic Single - Molecule Event Triggers Phenotype Switching of a Bacterial Cell. Science 2008, 322(5900):442–446. 10.1126/science.1161427PubMed CentralView ArticlePubMedGoogle Scholar
- Xie XS, Choi PJ, Li GW, Lee NK, Lia G: Single-molecule approach to molecular biology in living bacterial cells. Annu Rev Biophys 2008, 37: 417–444. 10.1146/annurev.biophys.37.092607.174640View ArticlePubMedGoogle Scholar
- Burmann BM, Schweimer K, Luo X, Wahl MC, Stitt BL, Gottesman ME, Rösch P: A NusE:NusG Complex Links Transcription and Translation. Science 2010, 328(5977):501–504. 10.1126/science.1184953View ArticlePubMedGoogle Scholar
- Ota K, Yamada T, Yamanishi Y, Goto S, Kanehisa M: Comprehensive Analysis of Delay in Transcriptional Regulation Using Expression Profiles. Genome Informatics 2003, 14: 302–303.Google Scholar
- Phroskin S, Rachid Rahmouni A, Mironov A, Nudler E: Cooperation between translating ribosomes and RNA polymerase in transcription elongation. Science 2010, 328(5977):504–508. 10.1126/science.1184939View ArticleGoogle Scholar
- Epshtein V, Nudler E: Cooperation between RNA polymerase molecules in transcription elongation. Science 2003, 300(5620):801–805. 10.1126/science.1083219View ArticlePubMedGoogle Scholar
- Lewin B: Genes IX. Jones and Bartlett Publishers, USA; 2008:256–299.Google Scholar
- Erie DA, Hajiseyedjavadi O, Young MC, von Hippel PH: Multiple RNA polymerase conformations and GreA: control of the fidelity of transcription. Science 1993, 262: 867–873. 10.1126/science.8235608View ArticlePubMedGoogle Scholar
- Greive SJ, Weitzel SE, Goodarzi JP, Main LJ, Pasman Z, von Hippel PH: Monitoring RNA transcription in real time by using surface plasmon resonance. Proc Natl Acad Sci USA 2008, 105: 3315–3320. 10.1073/pnas.0712074105PubMed CentralView ArticlePubMedGoogle Scholar
- McClure WR: Rate-limiting steps in RNA chain initiation. Proc Natl Acad Sci USA 1980, 77: 5634–5638. 10.1073/pnas.77.10.5634PubMed CentralView ArticlePubMedGoogle Scholar
- Balesco JG: All things must pass: Contrasts and commonalities in eukaryotic and bacterial mRNA decay. Nat Rev Mol Cell Biol 2010, 11(7):467–478. 10.1038/nrm2917View ArticleGoogle Scholar
- Hsu LM: Promoter clearance and escape in prokaryotes. Biochimica et Biophysica Acta - Gene Structure and Expression 2002, 1577(2):191–207. 10.1016/S0167-4781(02)00452-9View ArticleGoogle Scholar
- Jorgensen F, Kurland CG: Processivity errors of gene expression in Escherichia coli. J Mol Biol 1990, 215: 511–521. 10.1016/S0022-2836(05)80164-0View ArticlePubMedGoogle Scholar
- Moore SD, Sauer RT: Ribosome rescue: tmRNA tagging activity and capacity in Escherichia coli. Mol Microbiol 2005, 58: 456–466. 10.1111/j.1365-2958.2005.04832.xView ArticlePubMedGoogle Scholar
- Cormack BP, Valdivia RH, Falkow S: FACS-optimized mutants of the green fluorescent protein (GFP). Gene 1996, 173(1):33–38. 10.1016/0378-1119(95)00685-0View ArticlePubMedGoogle Scholar
- Alberts B, Johnson A, Lewis J, Raff M, Roberts K, Walter P: Molecular biology of the cell. Garland Science, USA; 2002.Google Scholar
- Sorensen MA, Kurland CG, Pedersen S: Codon usage determines translation rate in Escherichia coli. J Mol Biol 1989, 207: 365–377. 10.1016/0022-2836(89)90260-XView ArticlePubMedGoogle Scholar
- Menninger JR: Peptidyl transfer RNA dissociates during protein synthesis from ribosomes of Escherichia coli. J Biol Chem 1976, 251: 3392–3398.PubMedGoogle Scholar
- Shoji S, Walker SE, Fredrick K: Ribosomal translocation: One step closer to the molecular mechanism. ACS Chem Biol 2009, 4: 93–107. 10.1021/cb8002946PubMed CentralView ArticlePubMedGoogle Scholar
- Qin Y, Polacek N, Vesper O, Staub E, Einfeldt E, Wilson DN, Nierhaus KH: The highly conserved LepA is a ribosomal elongation factor that back-translocates the ribosome. Cell 2006, 127: 721–733. 10.1016/j.cell.2006.09.037View ArticlePubMedGoogle Scholar
- Keiler KC: Biology of trans-translation. Annu Rev Microbiol 2008, 62: 133–151. 10.1146/annurev.micro.62.081307.162948View ArticlePubMedGoogle Scholar
- Bracewell R: Pentagram Notation for Cross Correlation. The Fourier Transform and Its Applications. New York: McGraw-Hill; 1965:46–243.Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.