Skip to main content

A computational pipeline for identifying kinetic motifs to aid in the design and improvement of synthetic gene circuits

Abstract

Background

An increasing number of genetic components are available in several depositories of such components to facilitate synthetic biology research, but picking out those that will allow a designed circuit to achieve the specified function still requires multiple cycles of testing. Here, we addressed this problem by developing a computational pipeline to mathematically simulate a gene circuit for a comprehensive range and combination of the kinetic parameters of the biological components that constitute the gene circuit.

Results

We showed that, using a well-studied transcriptional repression cascade as an example, the sets of kinetic parameters that could produce the specified system dynamics of the gene circuit formed clusters of recurrent combinations, referred to as kinetic motifs, which appear to be associated with both the specific topology and specified dynamics of the circuit. Furthermore, the use of the resulting "handbook" of performance-ranked kinetic motifs in finding suitable circuit components was illustrated in two application scenarios.

Conclusions

These results show that the computational pipeline developed here can provide a rational-based guide to aid in the design and improvement of synthetic gene circuits.

Background

The goal of synthetic biology is to be able to engineer biological processes and to select and put together standardized components according to a design and user-specified function and dynamics [15]. To this end, several depositories of biological components have been established [612], but the design cycles still rely very heavily on the slow and error-prone process of trying out the parts [13, 14].

A fundamental problem is that we still lack clear knowledge of the factors that govern the dynamic behaviours of even the very simple circuits that are motifs of large biological networks [1318]. Although the structure, or topology, of a biological network may largely dictate its dynamics [1921], the kinetic parameters (e.g. those that indicate the level of efficiency) of the involved biochemical reactions also play a role [2227]. Thus, when performing a mathematical simulation to determine which biological components from the depositories should be chosen for a synthetically designed biological circuit with a specified dynamics, one needs to consider not only network topology, but also kinetic parameters.

In this work, we developed a computational pipeline, called Kinetic Motif and Functional Analysis (KMFA), to address this problem. By identifying the set of kinetic parameters required to produce the user-specified dynamic behaviour for a given network topology in both the presence and absence of random perturbations and using statistical analysis to identify recurrent patterns (i.e. motifs) of these kinetic parameters and understand their mechanics, KMFA provides a "handbook" of kinetic motifs in which one can look up a biological components library to choose suitable parts for optimal performance of the designed circuit.

We first demonstrated the utility of KMFA by applying it to a well known synthetic gene circuit of a looped cascade of transcriptional inhibitions built in Escherichia coli [2831]. We showed that, for this 4-gene circuit, only 2,355 (0.6%) of the 390,625 (58) possible combinations of the kinetic parameters could produce the prescribed steady-state concentration of each gene product as the output of the circuit under both perturbed and unperturbed conditions and that these kinetic solutions formed clusters of motifs, which could be ranked according to their relative performance, thus yielding a "handbook" of performance-ranked kinetic motifs that can be used to select library components for the circuit. We illustrated this using two scenarios: the first was to identify faulty/sub-optimal components and replace them with suitable ones to make a non-functional circuit functional, while the second was to improve the performance of an already functional circuit. KMFA is therefore a useful computational tool with a rational design capability to choose circuit components in synthetic biology research.

Methods

As shown in Figure 1, the KMFA pipeline consists of four steps. Although the concept and procedures of KMFA are generally applicable, we used a specific gene circuit to describe these steps and, later, in the Results, illustrate its utilities. This circuit was a four-gene transcriptional cascade that has been synthesized in E coli and has been studied both experimentally [28] and computationally [2931].

Figure 1
figure 1

Steps in the kinetic motif and functional analysis (KMFA) pipeline. KMFA is a computational pipeline consisting of four steps: system setup, functional mapping by simulation, identification of functional kinetic motifs and tabulation of performance-ranked kinetic motifs. The details of each step are described in the Methods.

Step 1: System setup

In this step, the system is set up to be numerically simulated in Step 2. This includes specifying the topology of the gene circuit or network, the mathematical model of the system, the target output dynamics and performance measures. Many studies have shown that deterministic modeling and stochastic modeling of biological systems have complementary benefits, the deterministic modeling providing the mean qualitative dynamics of a gene circuit and stochastic modeling the effects of random noises on the circuit [32, 33]. In this study, we modeled the dynamics of the transcriptional cascade using both deterministic and stochastic simulations, the latter being used to find kinetic solutions for a robust system that could function properly under perturbations. The mathematical equations for the deterministic simulations are [29]:

i t = f p i , d i = p i , 0 + p i r i x j t - d i x i t r i x j , t = 1 1 + x j t 1000 2
(1)

where x i (t) denotes the concentration of protein i and i ( t ) its rate of change at time t; f(p i ,d i ) denotes the non-linear gene regulation of transcription and translation of protein i; p i,0 is the basal production rate, p i the production rate constant and d i the degradation rate constant of protein i; and r i (x j (t)) denotes the Hill function of regulator protein j repressing the production of protein i at time t.

The mathematical equations for the stochastic simulations are [31]:

i ( t ) = g ( p i , d i ) = f ( p i , d i ) + Δ f ( p i , d i ) n i ( t ) + v i ( t ) = ( p i , 0 + p i r i ( y j ( t ) ) - d i y i ( t ) ) + ( Δ p i , 0 + Δ p i r i ( y j ( t ) ) - Δ d i y i ( t ) ) n i ( t ) + v i ( t )
(2)

where y i (t) denotes the concentration of protein i and i ( t ) its rate of change at time t; g(p i ,d i ) denotes the non-linear gene regulation of transcription and translation of protein i influenced by both intrinsic and external random noises; Δp i,0 , Δp i and Δd i denote the respective standard deviations for the parameters of basal production rate, production rate and degradation rate; and n i and v i are, respectively, randomly generated intrinsic and extrinsic noises for protein i.

This cascade of transcriptional inhibitions has been shown to produce steady-state concentrations of the proteins involved [2831]. To facilitate comparison, the target dynamics (z) were chosen to be the steady-state concentrations of protein TetR, LacI, CI and Eyfp used in Chen and Wu [31], giving z = {1000, 30000, 300, 30000 nM}. To measure the performance of a given set of kinetic parameters (p i , d i ), i = {TetR, LacI, CI, Eyfp}, deviations from the steady-state concentrations of the target were computed for the deterministic simulations using Eq. (3) and for the stochastic simulations using Eq. (4), where DD denotes the deterministic deviation, DS the stochastic deviation and ln a logarithmic transformation function.

D D = D e v i a t i o n ( x , z ) = l n ( i t x i ( t ) - z i ( t ) )
(3)
D S = D e v i a t i o n ( y , z ) = l n ( i t y i ( t ) - z i ( t ) )
(4)

In order to compare with the numerical results of [31], the two measures were not normalized even though z varies from 300 to 30000 nM. However, by dictating that a viable dynamics must not deviate from the target concentration by more than 20% for each of the four proteins (see below), we reduced the possibility of the contribution from CI, which has the smallest target concentration at 300 nM, being overtaken by those of the other proteins.

Step 2: Simulation

For our simulations, we employed the same range of parameter values and values for the basal production rate constants, initial protein concentrations and perturbation fluctuations used by Chen and Wu [31] (summarized in Additional file 1: Table S1). Because it is impossible to enumerate all the real values of the kinetic parameters, we uniformly binned each kinetic parameter into five different rate efficiency levels, with level 1 denoting the lowest and level 5 the highest strength (efficiency) of the kinetics, and used the mean of each bin as the representative of the bin (Additional file 1: Table S2) for the simulations. The five levels may respectively correspond to "weakest", "weak", "medium", "strong", and "strongest" categories that are often used in experimental studies to characterize, say, binding efficiency of ribosome binding sites [7]. Increasing the number of levels will increase the resolution on the transformation of parameter values but will also increase the computational cost and difficulties to identify kinetic motifs in subsequent analysis (see below). Since there are four genes in the system and each gene product (protein) is associated with two kinetic parameters, one for production and the other for degradation, there were a total of 390,625 (58) sets of kinetic parameters that could be used for simulations. For each of these parameter sets, Eq. (1) (deterministic) and Eq. (2) (stochastic) were simulated for a maximum of 100 time steps and differences in protein concentrations between the simulation and prescribed values (z) (Eq. (3) and Eq (4)) calculated. At any time point, if the concentration of any of the four proteins exceeded 106 nM, the simulation (Eq. (1) or Eq. (2)) was aborted prematurely and an extremely large deviation value of 15 was assigned. The other ill-behaved parameter sets, for which a value of 15 was assigned to their DD (Eq. (3)) or DS (Eq. (4)), were those in which at least one of the four proteins exhibited a steady-state concentration, computed as the mean for the second half of the simulation, that deviated by more than 20% from the prescribed value. Simulations with a deviation value of 15 were considered non-functional.

Step 3: Identification of kinetic motifs

All the sets of kinetic parameters that were not ill-behaved (i.e. both their DD and DS values were smaller than 15) were transformed into bin integers representing efficiency levels (see above), which were then clustered hierarchically [34] based on the similarity, computed by the Hamming distance [35], of the sequence of bin levels. This resulted in clusters of kinetic parameters, and each cluster, called a kinetic motif, could be represented by a consensus sequence logo, such as {(1, 1); (3, 3); (*, *); (3, 3)}, in which bin levels in integers of the two kinetic parameters for the production and degradation of each of the four proteins are paired in parentheses and * indicates any levels.

Step 4: Generation of a handbook of kinetic motifs

These motifs could be ranked based on their performance, i.e. how well they could produce the specified system dynamics, as measured by the deviations DD (Eq. (3)) and DS (Eq. (4)). This resulted in a table, or handbook, of performance-ranked kinetic parameter sets, which provides information that can be easily referred to in order to identify suitable genetic components to assemble a functional circuit that meets user-desired specifications.

Results

Mapping parameters to solutions

As described in the Methods, we comprehensively searched the parameter space, albeit using representatives of uniformly divided bins, for sets of kinetics parameters that could produce the specified steady-state system dynamics under both conditions of with and without perturbations. Figure 2A shows that only a very small fraction (2,355 or 0.6%) of the parameter sets sampled (58 = 390,625) could produce the specified dynamics, while the vast majority (387,535 or 99.2%) failed to function properly under either condition. Interestingly, some parameter sets (0.1%) functioned well only without perturbations, while others (0.1%) did so only with perturbations.

Figure 2
figure 2

Identification and clustering of functional kinetic motifs. (A) The solution map, in which each black dot represents the simulation result, as measured by deviations DD and DS, for one parameter set and the large black dot in the upper right corner represents many simulation results, because the values of their DD and DS were both set at 15 to indicate ill-behaved dynamics (see Methods). The resulting dynamics from both the deterministic (i.e. without perturbation) and stochastic (i.e. with perturbation) simulation for the simulated 4-gene transcriptional repression network fell into one of four groups. The vast majority (99.2%) failed to produce the specified dynamics, i.e. the specified steady-state concentration of the protein product of each of the four genes, in either the deterministic or the stochastic simulation, while a very low percentage succeeded in one, but not both, of the two types of simulation (0.1% for each). Those that succeeded in both also accounted for a very small percentage (0.6%) of the kinetic parameters simulated. A typical dynamics is shown for each of the four groups as insets; that on the left is deterministic and that on the right stochastic. (B) Clusters of kinetic motifs. The functional sets (2,355 sets or 0.6% of those sampled) were hierarchically clustered based on the integer sequence of their kinetic efficiency levels, which are colour-coded according to the spectrum shown to the far right, and each cluster could be represented by a motif logo (see Methods).

Clusters of kinetic motifs

Each of the 2,355 kinetic solutions obtained in Figure 2A is a combination of 8 kinetic parameters consisting of the pair of kinetic parameters p, for the production rate, and d, for the degradation rate of each of the four proteins. The resulting 2,355 8-integer sequences were compared and grouped into 52 clusters (Figure 2B), 48 of which had at least 10 members (i.e. 10 parameter sets that could produce the specified steady-state dynamics). Similar to sequences of DNA [36] or amino acids [3739], these clusters of kinetic parameter sets, or kinetic motifs, were represented by a motif logo made up of efficiency level integers in which the character size reflects the extent of consensus within the cluster, while '*' indicates no consensus at all (Figure 2B). These 48 clusters were tabulated in a handbook of kinetic motifs (Additional file 1: Table S3), along with their circuit structure, kinetic motif logo, number of cluster members and performance scores.

Functional association of kinetic motifs

From Additional file 1: Table S3, in which the clusters of kinetic motifs are ranked by their performance in decreasing value of the sum of their DD and DS scores, we observed that, on the whole, the top clusters tended to have high, though not necessarily the highest, production and degradation rates, i.e. there were more 3's or 4's and 5's than 1's and 2's in the motif logos, and that, conversely, levels 1 and 2 occurred more frequently in the motif logos for the clusters ranked at the bottom. This was particularly true for the 2nd (LacI) and 4th (Eyfp) proteins in the network, because they were required to have a high steady-state output concentration of 30,000 nM, compared to 1,000 nM for the 1st protein (TetR) and 300 nM for the 3rd (CI) (see Methods). Furthermore, for the same protein, its production and degradation rates seemed to be symmetry-related, i.e. level 3 degradation tended to be paired with level 3 production. Such symmetry is also visible in Figure 2B. These observations are consistent with our understanding of the dynamic behaviours of a transcription regulation system in that, in general, degradation rates determine the response time and fast degradation rates allow rapid changes in protein concentration and can, therefore, minimize the response time to stimulation [40, 41]. Furthermore, to maintain a given steady state concentration, proteins with a rapid degradation rate also require a high production rate, the final concentration being determined by both rates [16, 42]. Inspection of individual solutions with the highest degradation rate (level 5) showed that they tended to exhibit high fluctuations in protein concentrations before reaching the final steady-state concentrations, suggesting that the highest degradation rate (level 5) may not always be the most desirable for the designed circuit, which may also explain why the symmetry between protein production and degradation tended to break down a little when levels were very high (4 or 5; Additional file 1: Table S3).

Another interesting observation was that the third protein (CI) did not show a preference for a particular efficiency level for either of its two kinetic parameters (Figure 2B and Additional file 1: Table S3). As shown in Additional file 1: Fig. S1, this can be explained by examining the repression network and its rate equations. Briefly, the high initial and steady-state concentrations of protein LacI, which represses the cI gene, renders the contribution of gene regulation to CI production negligible and, thus, the magnitude of its production kinetic parameter, pCI, inconsequential; also, as a consequence, at the steady state, all the values within the range allowed for its degradation kinetic parameter, dCI, would lead to a CI concentration within 20% of the specified value, thus meeting the required condition for a functional solution (see Methods).

Application scenario I: making a non-functional design functional

Given a non-functional circuit, it is, at present, not easy to determine the reasons for the failure to function or to identify the faulty/sub-optimal component that needs to be replaced or corrected to salvage the design. The handbook of kinetic motifs and associated properties (Additional file 1: Table S3) is a great aid to solving this problem, as illustrated by the following example. As shown in Figure 3A, the original network was composed of components with kinetic parameters {(pTetR, dTetR); (pLacI, dLacI); (pCI, dCI); (pEyfp, dEyfp)} with values equivalent to efficiency levels of {(3, 3); (5, 1); (2, 4); (2, 1)} that could not produce the specified dynamics for the 2nd (LacI) and 4th (Eytp) proteins. Comparing this sequence of kinetic levels with those tabulated in Additional file 1: Table S3, we found two motifs with a sequence that differed at only two of the 8 positions (i.e. Hamming distance = 2), namely motif No. 47 [{(3, 3); (1, 1); (*, *); (1, 1)}] and motif No. 12 [{(3, 3); (5, 5); (*, *); (2, 2)}]. Note that, in this comparison, we ignored the 3rd protein (CI) because, as mentioned above, it does not have a preference for these kinetic levels. To modify the original design into one of these two motifs, we can change the efficiency level of the kinetic parameters by using a different ribosome binding site (RBS) for one or more of the genes and thereby changing their protein production level or by using a different protein degradation tag (PDT) to alter the protein degradation rate. Specifically, we can either replace the RBS in the original design with one with a protein production efficiency level of 1 for the 2nd and 4th proteins to change the original design into one that would behave like motif No. 47 or we can replace the PDT in the original design with one with protein degradation efficiency levels of 5 for the 2nd protein and 2 for the 4th protein, making it resemble motif No. 12. Both routes of modification would render the system functional, although the second would lead to a system with a better performance. Note that the key rule in making these changes is to maintain symmetry between the protein production and degradation rates for the same protein, as discussed above.

Figure 3
figure 3

Two scenarios demonstrating the application of KMFA. (A) Scenario I: Making a non-functional design functional. If we have a non-functional original design, as indicated by the dynamics of the deterministic and stochastic simulation shown, we can transform its kinetic parameters into efficiency levels and look in the handbook of kinetic motifs (Additional file 1: Table S3) for a similar, but functional, design to replace some of the ribosome binding sites (RBS, denoted as R and represented by triangles) and/or protein degradation tags (PDT, denoted as T and represented by hexagons) with others with a suitable efficiency level to render the modified design functional. In this example, two different routes of minimal changes (Hamming distance (D) of replacement = 2) were found. (B) Scenario II: Improving a functional design. In this example, the original, already functional, design is that reported by Chen and Wu [31], the ranking of which in our performance table (Additional file 1: Table S3) can be improved by selecting components with an RBS and/or PTD that can work with other components to generate a better system performance (i.e. better performance ranking in Additional file 1: Table S3). All notations and symbols are the same as those in (A).

Application scenario II: improving a functional design

In the second scenario (Figure 3B), the original circuit was already functional and, in fact, has been shown to be robust under perturbations by Chen and Wu [31]. However, it was ranked in the middle part of Additional file 1: Table S3 (motif No. 20) and therefore could still be improved. There is more than one way of improving the performance of the original design. The two proposed routes of change illustrated in Figure 3B would both lead to a superior performance ranking in the "handbook" of this circuit: the first involves increasing the efficiency level for both the RBS and PDT for the 2nd protein (LacI), while the second requires the two steps of replacing the RBS and PDT for the 1st protein (TetR), then those for the 2nd protein (LacI).

Discussion

Mathematical modelling and simulations are being increasingly used to help design simple biological circuits to achieve user-specified functions or certain network behaviours in synthetic biology [5, 4347]. Much work has been focused on finding suitable network topologies [1921], or kinetic parameters that can produce the desired system dynamics, given a network topology [2226, 2931]. In this work, we have developed a computational pipeline, KMFA, for the latter. Compared to previous studies, a distinctive feature of KMFA is that, by comprehensively mapping the parameter space and functional space (Figure 1, Step 2), the identified multiple sets of functional kinetic parameters can be clustered and their motifs (i.e. their recurring combinations) identified (Figure 2B) and analysed to reveal and understand significant functional associations (see Results, Figure 2).

In this work, the network was simulated with and without perturbations, the former being modeled by a stochastic process with randomly introduced intrinsic and extrinsic noises (Eq. (2)) and the latter by a deterministic process (Eq. (1)). The results showed that there are certain situations (i.e. certain combinations of the kinetic parameters) in which the system may function well under perturbations, but not in the absence of perturbations. Although it is well recognized that noise is an integral part of normal biological functions and how biological systems evolve to be robust [4853], our results suggest that, to design a 'truly robust' biological system, both conditions (with and without noise) need to be considered. Although further analysis is required to elucidate the mechanism for the effect of perturbation observed in this study, we speculate that perturbation alters the "background" protein production and degradation rates to allow the circuit to achieve and maintain target dynamics. Indeed, several studies have shown that gene circuits (e.g., switches and oscillations) rely on noise to achieve desired functions and would fail without them [5457].

As a proof-of-principle study, KMFA has been designed in the present work to exhaust all possible kinetic parameter combinations, which limits its application to very small systems such as the transcriptional cascade serving as a demonstrating example above. However, this limitation can be lifted by integrating with approaches such as Monte-Carlo sampling [58], Latin hypercube sampling [59], and others [29, 60] that have been developed to efficiently search for parameter solutions from high dimensional spaces. Indeed, several large systems (e.g. the Fas apoptotic pathway [61]) have been studied using some of these methods. Replacing our time-consuming simulation step (see Figure 1) with these efficient parameter-searching methods, one can collect the parameter solutions for an interesting function/dynamics of a large system, and then apply KMFA on those parameter solutions to find kinetic motifs.

An important finding of this study is that, as in studies of network topologies [1921], only a very small percentage of kinetic parameters can be functional, i.e. produce the desired system dynamics (Figure 2A). Obviously, network topology and kinetic parameters are intertwined, and the kinetic motifs (Additional file 1: Table S3) identified here will probably be specific not only for the specific arrangement of the components of the circuit, but also for the specified output protein concentrations. Nevertheless, for a given circuit design, the "handbook" of kinetic motifs (Additional file 1: Table S3) will be very useful in deciding which biological components available from a biological parts library, such as BioFab [10] and the MIT Registry of Standard Biological Parts [7], should be used, as illustrated by the two application scenarios presented in Figure 3. Thus, the handbook could be used to standardize the biological parts (e.g. by classifying them into 5 efficiency levels) and, via ranking all possible combinations of the parts, standardize the circuit design process. The handbook could furthermore be used to help synthesize required parts of a specified efficiency level not yet collected in the library, especially the ribosome binding sites for which a relationship between binding efficiency and binding site sequence has been elucidated [62]. As more and more genetic components (e.g. promoters, ribosome binding sites, terminators and protein degradation tags) are deposited and characterized, mathematical simulations using a computational pipeline, such as KMFA, will make the synthetic design and engineering of biological circuits more efficient and also more rationale-based. Furthermore, the general applicability of the KMFA approach will allow simulations of different network topologies and different functions (see Additional file 1 for additional illustration on an 'AND'-gate circuit [63, 64]) in comparative studies to uncover intriguing common/distinct principles of biological networks, particularly those pertaining to kinetic parameters, which have received less attention than network topologies.

Conclusions

Knowledge about how to choose suitable components for a designed gene circuit is required for efficient research in synthetic biology. For a given topology of a designed gene circuit, the computational pipeline, KMFA, developed here, has produced a "handbook" of performance-ranked kinetic motifs that can serve as a user guide to allow the selection and matching up of different genetic components to achieve user-specified system functions. In addition to being a useful aid for the synthetic design of biological circuits, KMFA can also be used to elucidate the intertwined relationship of the trinity of prototype biological circuits: topology, kinetics of the parts and function.

References

  1. Endy D: Foundations for engineering biology. Nature. 2005, 438 (7067): 449-453. 10.1038/nature04342.

    Article  CAS  PubMed  Google Scholar 

  2. Guido NJ, Wang X, Adalsteinsson D, McMillen D, Hasty J, Cantor CR, Elston TC, Collins JJ: A bottom-up approach to gene regulation. Nature. 2006, 439 (7078): 856-860. 10.1038/nature04473.

    Article  CAS  PubMed  Google Scholar 

  3. Canton B, Labno A, Endy D: Refinement and standardization of synthetic biological parts and devices. Nat Biotechnol. 2008, 26 (7): 787-793. 10.1038/nbt1413.

    Article  CAS  PubMed  Google Scholar 

  4. Ellis T, Wang X, Collins JJ: Diversity-based, model-guided construction of synthetic gene networks with predicted functions. Nat Biotechnol. 2009, 27 (5): 465-471. 10.1038/nbt.1536.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  5. Slusarczyk AL, Lin A, Weiss R: Foundations for the design and implementation of synthetic genetic circuits. Nat Rev Genet. 2012, 13 (6): 406-420. 10.1038/nrg3227.

    Article  CAS  PubMed  Google Scholar 

  6. Edelman GM, Meech R, Owens GC, Jones FS: Synthetic promoter elements obtained by nucleotide sequence variation and selection for activity. Proc Natl Acad Sci USA. 2000, 97 (7): 3038-3043. 10.1073/pnas.97.7.3038.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  7. MIT Registry of Standard Biological Parts. [http://partsregistry.org]

  8. BioBricks Foundation. [http://bbf.openwetware.org/]

  9. Murphy KF, Balazsi G, Collins JJ: Combinatorial promoter design for engineering noisy gene expression. Proc Natl Acad Sci USA. 2007, 104 (31): 12726-12731. 10.1073/pnas.0608451104.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  10. BioFab: International Open Facility Advancing Biotechnology. [http://www.biofab.org/]

  11. Constante M, Grunberg R, Isalan M: A biobrick library for cloning custom eukaryotic plasmids. PloS one. 2011, 6 (8): e23685-10.1371/journal.pone.0023685.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  12. Galdzicki M, Rodriguez C, Chandran D, Sauro HM, Gennari JH: Standard biological parts knowledgebase. PloS one. 2011, 6 (2): e17005-10.1371/journal.pone.0017005.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  13. Lu TK, Khalil AS, Collins JJ: Next-generation synthetic gene networks. Nat Biotechnol. 2009, 27 (12): 1139-1150. 10.1038/nbt.1591.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  14. Porcar M, Danchin A, de Lorenzo V, Dos Santos VA, Krasnogor N, Rasmussen S, Moya A: The ten grand challenges of synthetic life. Syst Synth Biol. 2011, 5: (1-2):1-9. 10.1007/s11693-011-9084-5.

    Article  PubMed Central  PubMed  Google Scholar 

  15. Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U: Network motifs: simple building blocks of complex networks. Science. 2002, 298 (5594): 824-827. 10.1126/science.298.5594.824.

    Article  CAS  PubMed  Google Scholar 

  16. Alon U: An introduction to systems biology: design principles of biological circuits. 2007, Boca Raton, FL: Chapman & Hall/CRC

    Google Scholar 

  17. Shinar G, Feinberg M: Structural sources of robustness in biochemical reaction networks. Science. 2010, 327 (5971): 1389-1391. 10.1126/science.1183372.

    Article  CAS  PubMed  Google Scholar 

  18. Tyson JJ, Novak B: Functional motifs in biochemical reaction networks. Annu Rev Phys Chem. 2010, 61: 219-240. 10.1146/annurev.physchem.012809.103457.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  19. Brandman O, Meyer T: Feedback loops shape cellular signals in space and time. Science. 2008, 322 (5900): 390-395. 10.1126/science.1160617.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  20. Ma W, Trusina A, El-Samad H, Lim WA, Tang C: Defining network topologies that can achieve biochemical adaptation. Cell. 2009, 138 (4): 760-773. 10.1016/j.cell.2009.06.013.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  21. Cotterell J, Sharpe J: An atlas of gene regulatory networks reveals multiple three-gene mechanisms for interpreting morphogen gradients. Mol Syst Biol. 2010, 6: 425-

    Article  PubMed Central  PubMed  Google Scholar 

  22. de Jong H: Modeling and simulation of genetic regulatory systems: a literature review. Journal of computational biology: a journal of computational molecular cell biology. 2002, 9 (1): 67-103. 10.1089/10665270252833208.

    Article  CAS  Google Scholar 

  23. Li ZF, Osborne MR, Prvan T: Parameter estimation of ordinary differential equations. Ima J Numer Anal. 2005, 25 (2): 264-285. 10.1093/imanum/drh016.

    Article  Google Scholar 

  24. Ingram PJ, Stumpf MP, Stark J: Network motifs: structure does not determine function. BMC genomics. 2006, 7: 108-10.1186/1471-2164-7-108.

    Article  PubMed Central  PubMed  Google Scholar 

  25. Szallasi Z, Stelling J, Periwal V: System modeling in cell biology: from concepts to nuts and bolts. 2006, Cambridge, Mass.: MIT Press

    Chapter  Google Scholar 

  26. Chou IC, Voit EO: Recent developments in parameter estimation and structure identification of biochemical and genomic systems. Mathematical biosciences. 2009, 219 (2): 57-83. 10.1016/j.mbs.2009.03.002.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  27. Palsson B: Systems biology: simulation of dynamic network states. 2011, Cambridge, UK; New York: Cambridge University Press

    Chapter  Google Scholar 

  28. Hooshangi S, Thiberge S, Weiss R: Ultrasensitivity and noise propagation in a synthetic transcriptional cascade. Proc Natl Acad Sci USA. 2005, 102 (10): 3581-3586. 10.1073/pnas.0408507102.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  29. Batt G, Yordanov B, Weiss R, Belta C: Robustness analysis and tuning of synthetic gene networks. Bioinformatics. 2007, 23 (18): 2415-2422. 10.1093/bioinformatics/btm362.

    Article  CAS  PubMed  Google Scholar 

  30. Chen BS, Chang CH, Lee HC: Robust synthetic biology design: stochastic game theory approach. Bioinformatics. 2009, 25 (14): 1822-1830. 10.1093/bioinformatics/btp310.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  31. Chen BS, Wu CH: A systematic design method for robust synthetic biology to satisfy design specifications. BMC Syst Biol. 2009, 3: 66-10.1186/1752-0509-3-66.

    Article  PubMed Central  PubMed  Google Scholar 

  32. Scott M, Hwa T, Ingalls B: Deterministic characterization of stochastic genetic circuits. Proc Natl Acad Sci USA. 2007, 104 (18): 7402-7407. 10.1073/pnas.0610468104.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  33. Twycross J, Band LR, Bennett MJ, King JR, Krasnogor N: Stochastic and deterministic multiscale models for systems biology: an auxin-transport case study. BMC Syst Biol. 2010, 4: 34-10.1186/1752-0509-4-34.

    Article  PubMed Central  PubMed  Google Scholar 

  34. Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences of the United States of America. 1998, 95 (25): 14863-14868. 10.1073/pnas.95.25.14863.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  35. Hamming RW: Error Detecting and Error Correcting Codes. At&T Tech J. 1950, 29 (2): 147-160.

    Google Scholar 

  36. D'haeseleer P: How does DNA sequence motif discovery work?. Nat Biotechnol. 2006, 24 (8): 959-961. 10.1038/nbt0806-959.

    Article  PubMed  Google Scholar 

  37. Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990, 18 (20): 6097-6100. 10.1093/nar/18.20.6097.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  38. Bork P, Koonin EV: Protein sequence motifs. Curr Opin Struct Biol. 1996, 6 (3): 366-376. 10.1016/S0959-440X(96)80057-1.

    Article  CAS  PubMed  Google Scholar 

  39. Bailey TL, Williams N, Misleh C, Li WW: MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006, 34 (Web Server): W369-373. 10.1093/nar/gkl198.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  40. Rosenfeld N, Alon U: Response delays and the structure of transcription networks. J Mol Biol. 2003, 329 (4): 645-654. 10.1016/S0022-2836(03)00506-0.

    Article  CAS  PubMed  Google Scholar 

  41. Sayut DJ, Sun L: Slow activator degradation reduces the robustness of a coupled feedback loop oscillator. Mol Biosyst. 2010, 6 (8): 1469-1474. 10.1039/c003480k.

    Article  CAS  PubMed  Google Scholar 

  42. Schimke RT, Doyle D: Control of enzyme levels in animal tissues. Annu Rev Biochem. 1970, 39: 929-976. 10.1146/annurev.bi.39.070170.004433.

    Article  CAS  PubMed  Google Scholar 

  43. Endy D, Brent R: Modelling cellular behaviour. Nature. 2001, 409 (6818): 391-395. 10.1038/35053181.

    Article  CAS  PubMed  Google Scholar 

  44. Kaznessis YN: Models for synthetic biology. BMC systems biology. 2007, 1: 47-10.1186/1752-0509-1-47.

    Article  PubMed Central  PubMed  Google Scholar 

  45. Zheng Y, Sriram G: Mathematical modeling: bridging the gap between concept and realization in synthetic biology. Journal of biomedicine & biotechnology. 2010, 2010: 541609-

    Article  Google Scholar 

  46. Liang J, Luo Y, Zhao H: Synthetic biology: putting synthesis into biology. Wiley interdisciplinary reviews Systems biology and medicine. 2011, 3 (1): 7-20. 10.1002/wsbm.104.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  47. MacDonald JT, Barnes C, Kitney RI, Freemont PS, Stan GB: Computational design approaches and tools for synthetic biology. Integrative biology: quantitative biosciences from nano to macro. 2011, 3 (2): 97-108. 10.1039/c0ib00077a.

    Article  Google Scholar 

  48. Gonze D, Halloy J, Goldbeter A: Robustness of circadian rhythms with respect to molecular noise. Proc Natl Acad Sci USA. 2002, 99 (2): 673-678. 10.1073/pnas.022628299.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  49. Kitano H: Biological robustness. Nat Rev Genet. 2004, 5 (11): 826-837. 10.1038/nrg1471.

    Article  CAS  PubMed  Google Scholar 

  50. Ciliberti S, Martin OC, Wagner A: Robustness can evolve gradually in complex regulatory gene networks with varying topology. PLoS Comput Biol. 2007, 3 (2): e15-10.1371/journal.pcbi.0030015.

    Article  PubMed Central  PubMed  Google Scholar 

  51. Lapidus S, Han B, Wang J: Intrinsic noise, dissipation cost, and robustness of cellular networks: the underlying energy landscape of MAPK signal transduction. Proc Natl Acad Sci USA. 2008, 105 (16): 6039-6044. 10.1073/pnas.0708708105.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  52. Lander AD: Pattern, growth, and control. Cell. 2011, 144 (6): 955-969. 10.1016/j.cell.2011.03.009.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  53. Munsky B, Neuert G, van Oudenaarden A: Using gene expression noise to understand gene regulation. Science. 2012, 336 (6078): 183-187. 10.1126/science.1216379.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  54. Hasty J, Pradines J, Dolnik M, Collins JJ: Noise-based switches and amplifiers for gene expression. Proc Natl Acad Sci USA. 2000, 97 (5): 2075-2080. 10.1073/pnas.040411297.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  55. Paulsson J, Berg OG, Ehrenberg M: Stochastic focusing: fluctuation-enhanced sensitivity of intracellular regulation. Proc Natl Acad Sci USA. 2000, 97 (13): 7148-7153. 10.1073/pnas.110057697.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  56. Lewis J: Autoinhibition with transcriptional delay: a simple mechanism for the zebrafish somitogenesis oscillator. Curr Biol. 2003, 13 (16): 1398-1408. 10.1016/S0960-9822(03)00534-7.

    Article  CAS  PubMed  Google Scholar 

  57. Stewart-Ornstein J, Weissman JS, El-Samad H: Cellular noise regulons underlie fluctuations in Saccharomyces cerevisiae. Mol Cell. 2012, 45 (4): 483-493. 10.1016/j.molcel.2011.11.035.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  58. Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E: Equation of State Calculations by Fast Computing Machines. J Chem Phys. 1953, 21 (6): 1087-1092. 10.1063/1.1699114.

    Article  CAS  Google Scholar 

  59. Iman RL, Helton JC, Campbell JE: An Approach to Sensitivity Analysis of Computer-Models. 1. Introduction, Input Variable Selection and Preliminary Variable Assessment. J Qual Technol. 1981, 13 (3): 174-183.

    Google Scholar 

  60. Zamora-Sillero E, Hafner M, Ibig A, Stelling J, Wagner A: Efficient characterization of high-dimensional parameter spaces for systems biology. BMC systems biology. 2011, 5:

    Google Scholar 

  61. Hua F, Hautaniemi S, Yokoo R, Lauffenburger DA: Integrated mechanistic and data-driven modelling for multivariate analysis of signalling pathways. Journal of the Royal Society, Interface / the Royal Society. 2006, 3 (9): 515-526. 10.1098/rsif.2005.0109.

    Article  PubMed Central  PubMed  Google Scholar 

  62. Salis HM, Mirsky EA, Voigt CA: Automated design of synthetic ribosome binding sites to control protein expression. Nature biotechnology. 2009, 27 (10): 946-950. 10.1038/nbt.1568.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  63. Terzer M, Jovanovic A, Choutko A, Nikolayeva O, Korn A, Brockhoff D, Zurcher F, Friedmann M, Schutz R, Zitzler E: Design of a biological half adder. IET Synthetic Biology. 2007, 1: (1-2):53-58. 10.1049/iet-stb:20079022.

    Article  Google Scholar 

  64. Chen BS, Hsu CY, Liou JJ: Robust design of biological circuits: evolutionary systems biology approach. Journal of biomedicine & biotechnology. 2011, 2011: 304236-

    Google Scholar 

Download references

Acknowledgements

This work was supported by a grant from the National Science Council of Taiwan (NSC grant no. 100-2311-B-001-021). We would like to thank Dr. W.C. Liu for useful comments and Dr. Tom Barkas for English editing.

Declarations

The publication costs for this article were funded by Academia Sinica, Taipei, Taiwan.

This article has been published as part of BMC Bioinformatics Volume 14 Supplement 16, 2013: Twelfth International Conference on Bioinformatics (InCoB2013): Bioinformatics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/14/S16.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ming-Jing Hwang.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

Conceived and designed the experiments: AWTC MJH. Performed the experiments: AWTC. Analyzed the data: AWTC MJH. Wrote the paper: AWTC MJH.

Electronic supplementary material

12859_2013_6865_MOESM1_ESM.PDF

Additional file 1: Table S1: Ranges and values of parameters used for simulations in this study. Table S2: RBSs and PDTs of different efficiency levels used in this study for the structural arrangement RBS-gene-PDT. Table S3: Handbook of kinetic motifs for the transcriptional repression cascade circuit simulated. Figure S1: Analytical analysis of the kinetic parameters of protein CI. Additional illustration: The case of an 'AND'-gate circuit. (PDF 753 KB)

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Chiang, A.W., Hwang, MJ. A computational pipeline for identifying kinetic motifs to aid in the design and improvement of synthetic gene circuits. BMC Bioinformatics 14 (Suppl 16), S5 (2013). https://doi.org/10.1186/1471-2105-14-S16-S5

Download citation

  • Published:

  • DOI: https://doi.org/10.1186/1471-2105-14-S16-S5

Keywords