Volume 13 Supplement 4
Modeling gene regulatory network motifs using statecharts
© Fioravanti et al.; licensee BioMed Central Ltd. 2012
Published: 28 March 2012
Gene regulatory networks are widely used by biologists to describe the interactions among genes, proteins and other components at the intra-cellular level. Recently, a great effort has been devoted to give gene regulatory networks a formal semantics based on existing computational frameworks.
For this purpose, we consider Statecharts, which are a modular, hierarchical and executable formal model widely used to represent software systems. We use Statecharts for modeling small and recurring patterns of interactions in gene regulatory networks, called motifs.
We present an improved method for modeling gene regulatory network motifs using Statecharts and we describe the successful modeling of several motifs, including those which could not be modeled or whose models could not be distinguished using the method of a previous proposal.
We model motifs in an easy and intuitive way by taking advantage of the visual features of Statecharts. Our modeling approach is able to simulate some interesting temporal properties of gene regulatory network motifs: the delay in the activation and the deactivation of the "output" gene in the coherent type-1 feedforward loop, the pulse in the incoherent type-1 feedforward loop, the bistability nature of double positive and double negative feedback loops, the oscillatory behavior of the negative feedback loop, and the "lock-in" effect of positive autoregulation.
We present a Statecharts-based approach for the modeling of gene regulatory network motifs in biological systems. The basic motifs used to build more complex networks (that is, simple regulation, reciprocal regulation, feedback loop, feedforward loop, and autoregulation) can be faithfully described and their temporal dynamics can be analyzed.
In order to understand how biological systems behave, a branch of systems biology [1, 2] called "executable cell biology"  aims to construct computational models which mimic their behavior and which can be used for simulating, in a faithful and cost-effective way, their reactions to external stimuli. The computational model, which is built upon knowledge obtained by performing some in vitro experiments, should be complete (it should be able to reproduce all the experimental data) and correct (it should be possible to reproduce its behavior experimentally).
The correspondence between the in silico model and in vitro observed behaviors is verified by applying model checking techniques . If the model is found to be not consistent with the experimental data, it must be refined and experimentally validated again.
A notable side-effect of the model construction process is that the computational model may suggest new hypotheses about the behavior of the biological system which can then be verified by performing in vitro or in vivo experiments.
A largely studied class of biological systems is constituted by systems which regulate the expression of genes in an organism. Their behavior is often represented by using gene regulatory networks (GRNs), which describe the interactions among genes, proteins and other components at the intra-cellular level. GRNs have been successful among biologists because they constitute an easy to use and intuitive tool which can be used to represent the biological model under consideration. However, their lack of formal semantics prevents their direct use for performing reliable and consistent simulations and for model checking with experimental data.
There have been several attempts to define formal mathematical and computational frameworks for modeling GRNs. They can be classified into quantitative approaches, using differential equations or stochastic models , and qualitative approaches, mostly based on boolean networks , Petri nets [7, 8], and bayesian networks . See  for a detailed analysis and survey of modelling and analysis of GRNs. Motifs have been identified that are significantly overrepresented in biological networks [5, 11–14]. The same motifs have been found in organisms at different levels of complexity, ranging from bacteria to humans. The relationships between different types of motifs and their function have been explored in a number of simple cases, in silico and in vivo [15, 16].
Recently, Shin and Nourani  have used Statecharts (SCs) , a computational framework with a visual language and well-defined semantics, for modeling some small and recurring patterns of interactions in GRNs, called motifs .
Gene Regulatory Network motifs
GRN motifs are pattern of interconnections occurring in real GRNs with a frequence that is significantly higher than that in a randomly generated GRN.
Their high frequency suggests that they play an important role in the GRN function and can, thus, be considered as its building blocks.
The functional role of most common GRN motifs has been extensively studied in some organisms, such as E. coli and other model organisms .
The simple regulation motif
The simple regulation motif is one of the most basic interaction patterns. It is composed of two genes X, Y, where X regulates Y and the interaction is mediated by a signal S X . The signal can act as an inducer molecule that binds X or can represent a modification of X which activates it. Since the regulation of X on Y is either activation or repression and S X can mediate the regulation with either presence or absence, four possible types of motifs can be described.
The feedback loop motif
The feedback loop motif is composed of two genes X and Y, which regulate each other, and their interactions are mediated by a signal S X (for X regulating Y ) and a signal S Y (for Y regulating X). Since the reciprocal regulations between X and Y can be either activations or repressions we have different feedback loop motifs.
A feedback loop motif is double-positive if both the reciprocal regulations of the two genes X and Y are positive, that is, X and Y activate each other (Figure 1B, left). Similarly, a feedback loop motif is double-negative if X and Y repress each other (Figure 1B, middle). If the effects of the reciprocal regulations of the two genes X and Y are of different polarity, that is, X represses Y and Y activates X or viceversa, the feedback loop motif is said to be negative. Due to symmetry, we consider only the former negative feedback loop motif (see Figure 1B, right).
The feedforward loop motifs
The feedforward loop (FFL) motifs are commonly found in many GRNs of widely studied organisms like yeast and E. coli. They are composed of three genes X, Y, and Z, where X regulates Y and Z, and Y regulates Z. For reasons of simplicity from now on we discuss only the motifs where the regulatory effect depends on the presence of the mediating signals, but our findings apply also to the cases of their absence. Each type of regulation can be either activation or repression. Here we use the term coherent (resp. incoherent) to denote the case where the sign of the direct regulation from X to Z is the same (resp. the opposite) as the overall sign of the indirect regulation path through Y, as in the seminal paper of Mangan and Alon . Out of the eight possible FFL motifs, the most frequently encountered ones  are the coherent type-1 FFL motif c1 and the incoherent type-1 FFL motif i1, both shown in Figure 1C.
The combination of the regulations on gene Z by genes X and Y can be given different interpretations . In the following we will assume that such regulations are combined using the AND logic function, as in the arabinose system of E. coli . Although other functions seem to be more appropriate for use in other systems, the AND and OR functions are sufficient to explain the most peculiar properties of FFL.
The autoregulation motifs
The characteristic element of an autoregulation motif is a gene regulating itself. The autoregulation motif is positive if Y activates itself (see par in Figure 1D) and is negative if Y represses itself (see nar in Figure 1D).
SCs extend state transition diagrams by adding concurrency (i.e., the capability of representing a state as made up by smaller components all active at the same time) and hierarchy (i.e., the possibility of representing a state with a set of more detailed substates). The hierarchical structuring capabilities of SCs allow one to model systems at different levels of detail, while concurrency is useful for modeling multiple, mostly independent, portions of a system. Moreover, SCs are compositional, that is, they can be defined in terms of other SCs, thus making the specifications more reusable.
These additional features, if correctly exploited, provide a solution to the scalability problems of other computational modeling techniques like, e.g., those based on boolean networks and Petri nets, whose effectiveness rapidly decreases when applied to larger systems .
We now summarize some of the SCs features that we believe are essential to understand their potential. Please refer to  for more complete and detailed information.
A SC is composed of states and of transitions between states. A state is composite, if it contains other states, and is simple, otherwise. A composite state is parallel if its sub-states are executed concurrently, and is exclusive if exactly one of its sub-states is executed. The overall state of a SC is given by all the atomic states currently under execution.
Transitions are used to specify how a system evolves changing its internal state according to the external stimuli. They can be labeled by events which trigger their activation and the consequent change of state of the system, conditions for their applicability, and actions to be performed during their execution.
SCs have very good software tool support [22–27], which can be used to generate source code (e.g. in Java) whose execution corresponds to the SCs semantics, and to interactively simulate the system execution. SCs have been extensively studied in software and systems engineering, and have demonstrated to be particularly well-suited for modeling and designing reactive systems, that is, systems which evolve reacting to internal or external events, or changed conditions. In the case of GRNs these events can be, for example, the introduction or removal of a protein or of another component.
SCs have also been successfully used to model pancreatic organogenesis in the embryonic mouse , cell fate specification during C. elegans vulval development , and T-cell development in the thymus .
Shin and Nourani have used SCs to model GRN motifs . In their approach, each element (gene, protein, signal) can be in one of the two states: "on", which means that the gene is expressed or that the protein is present and active, and "off", which means that the gene is not expressed or that the protein is not present or present in its inactive form.
Moreover, activating interactions in GRNs are translated to transitions from the "off" state to the "on" state for the gene being activated. Similarly, inhibiting interactions correspond to transitions from the "on" state to the "off" state.
Their SCs model of the coherent simple regulation motifs s1 and s2 is shown in Figure 2B, which in their approach represents also the autoregulation motifs.
Results and discussion
We present an improved method for modeling gene regulatory network motifs by using SCs and we show its application to model a number of motifs. As in the Shin and Nourani  approach we use two states "on" and "off" to model each element with the same meaning.
Transitions in our approach are labeled with a logical formula, expressed in terms of presence or absence of genes and signals, which activates the transition when true. Whenever the transitions between "on" and "off" states are not present in our SCs model of a motif this means that the corresponding elements are the independent variables of the modeled motif and their state is possibly changed as a consequence of events outside the motif itself.
A distinctive and novel feature of our method with respect to the method of Shin and Nourani is that we map the elements which are involved in the regulation to concurrent states. This offers a number of advantages that will be detailed in the following.
We also study the temporal behavior of GRN motifs. Given the discrete nature of SCs, the temporal behavior of SCs models of GRN motifs is somewhat rough, but anyhow allows us to simulate some interesting temporal properties of GRN motifs. We are able to model the delay in the activation and the deactivation of the "output" gene in the coherent type-1 feedforward loop motif (c1 FFL), and the pulse in the incoherent type-1 feedforward loop motif (i1 FFL). We are also able to partially model the temporal dynamics of feedback loop motifs and autoregulation motifs, in the sense that the qualitative behavior is represented but the boolean nature of our SCs based approach does not allow us to model more sophisticated temporal mechanisms which require the use of quantitative aspects, like acceleration and damping.
Model of simple regulation
Our models of the simple regulation motifs s1 and s2 are shown in Figure 3A left and right. In both cases, all the elements involved in the regulation, the genes X and Y and the signal S X are modeled as concurrent states, and, for each of them, we use two states for modeling its presence (and absence). The activation and deactivation of the regulated gene are modeled by two transitions connecting its presence states, which are triggered according to the truth value of logical formulas depending on the presence of the gene X and the signal S X . Note that in the logical formulas the green symbol ∨ represents the logical connective OR while the orange symbol ∧ the logical connective AND. Note also that in the logical formulas for any element X, the expression X = 1 is abbreviated as X and the expression X = 0 is abbreviated as .
Our approach for modeling simple regulation is non-ambiguous, because motifs s1 and s2 are represented by two different SCs. See again Figure 3A for our model and compare it with the ambiguity deriving from Shin and Nourani model shown in Figure 2B, where the same SC is used to describe both s1 and s2. Mapping different motifs onto the same SC is a potential source of problems when the mapping is inverted (i.e., from the SC to motifs) because it is not clear whether the SC should be mapped on both the original motifs (thus, possibly leading to an over-specification) or it should be mapped on only one of them.
In fact, their model does not fully exploit the concurrency features of SC. This determines sub-optimality, because it does not allow to reduce the size of the system. Their method is therefore not scalable: the complexity of their models grows faster than their size. Moreover, since the states of the regulated gene are modeled as substates of the regulating gene, and not as concurrent states, it is not possible to model networks containing genes which reciprocally regulate each other (see the model of feedback loop presented below). Note that these problems of  just described with reference to coherent simple regulations also affect the modeling of the other, more complex, motifs.
Similar considerations also apply to the modeling of the incoherent simple regulation motifs s3 and s4, whose SCs models with our approach are shown in Figure 3C.
Model of feedback loop
The feedback loop motif is not addressed by the modeling approach defined by Shin and Nourani  and we will shortly prove that it cannot be. We first note that the authors themselves observe in the "Further Discussion" section of their paper  that feedback loop motif is not part of their modeling scheme and that they intend to incorporate it in the future. We observe that this is not possible in their method, because it requires the states of the regulated gene to be substates of the states of the regulating gene. Since in the feedback loop motif X and Y act as both regulated and regulating genes, this requirement cannot be fulfilled.
Our modeling approach does not have this limitation because, as already mentioned, the genes and the signals are modeled as concurrent states.
From now we shall discuss also the temporal behavior of each SCs model representing a given in vitro motif so as to determine how closely each model is able to reproduce the corresponding in vitro behavior. Note that since a SC is a discrete model the state of the regulated gene at time instant t + 1 depends on the state of its regulating gene at time instant t. Also note that the results of this investigation are a priori limited by the fact that since our SCs models are boolean any behavior requiring more than two values in the domain cannot be reproduced.
The temporal behavior of the SCs model of the double-positive feedback loop motif is shown in the diagrams reported in Figure 4B. In particular, when X and Y are initially both present or both absent, it exhibits the "joint bistability" behavior , that is X and Y are either both always "off" or both always "on", as shown in Figure 4B (left and middle). But, as you can see in Figure 4B (right), when the initial state for X and Y is different, the temporal behavior, due to the approximation of the boolean domain where only two values are available, is not able to escape from the oscillating pattern to fall into one of the two steady states that are known from the in vitro experiments [5, 31].
Also in this case, our SCs model is able to reproduce the temporal behavior of the motif, that is, X always "on" and Y always "off", or viceversa (this is called "exclusive bistability" in ). The corresponding diagrams are reported in Figure 5B (left and middle). Once again, the roughness of the boolean model does not allow the temporal behavior to be attracted into one of the two steady states when the initial states of X and Y are the same, see Figure 5B (right).
Model of coherent feedforward loop
A diagrammatic representation of the temporal behavior of the considered SCs model is reported in Figure 7B, where it can be observed (right) that there is no delay in the deactivation of Z (Z and Y become both inactive at time instant t = 3 immediately after X disappears at time instant t = 2), but its activation (left) is delayed (only Y is active in the time instant t = 3 right after X appears at time instant t = 2, and Z becomes active only in the step after Y's activation, that is at time instant t = 4).
Model of incoherent feedforward loop
The i1 FFL motif (once again, with the AND combination of X's and Y's regulations on Z) has been used as a model of the galactose system in E. coli  where it produces an impulsive behavior in the regulated gene which first rises very quickly and afterwards soon goes down.
On the other side, our SCs model is not able to express the response acceleration dynamics of the i1 FFL motif with respect to simple regulation , as previously said in the discussion of the intrinsic limitation of the boolean domain. We are currently working on the extension of our SCs-based approach to the more general case of a many-valued discrete domain.
Model of autoregulation
The negative autoregulation motif is a very common and widely studied pattern of regulation.
Experimental results  have shown that it behaves as an accelerator of the gene response (with respect to the simple regulation motif), in presence of a high initial concentration of the self-regulating gene. The opposite behavior is exhibited by the positive autoregulation motif which slows down the production of the gene .
We have presented a Statecharts-based approach for modeling motifs of gene regulatory networks which (i) avoids the representation problems (incompleteness, no-concurrency, ambiguity) of a previous proposal , (ii) is able to model motifs that were not possible to model by following the approach of , (iii) produces more faithful models for the autoregulation motifs than , and (iv) is able to exhibit a temporal dynamics which qualitatively follows the actual biological dynamics.
More specifically, we have been able to represent simple regulation, feedforward loop, feedback loop, and autoregulation, which represent the basic motifs that can be used to model more complex networks. Furthermore, our approach, even if intrinsically boolean and discrete, allows us to give a faithful qualitative description of the temporal behavior in the coherent type-1 feedforward loop motif (c1 FFL), in the incoherent type-1 feedforward loop motif (i1 FFL), in feedback loop motifs, and in the positive autoregulation motif.
We are now planning, as future work, to extend our approach to consider also quantitative information, so as to provide a more realistic executable model of GRN motifs and their temporal dynamics.
List of abbreviations used
Gene Regulatory Network
This work has been supported by the PRIN-MIUR 2008 project SIMBIOSYS. We thank Finn Drabløs for having suggested to investigate Statecharts modeling of GRN motifs and his useful comments on a draft of this paper.
This article has been published as part of BMC Bioinformatics Volume 13 Supplement 4, 2012: Italian Society of Bioinformatics (BITS): Annual Meeting 2011. The full contents of the supplement are available online at http://www.biomedcentral.com/1471-2105/13/S4.
- Kitano H: Systems biology: a brief overview. Science 2002, 295(5560):1662–1664. 10.1126/science.1069492View ArticlePubMedGoogle Scholar
- Ideker T, Galitski T, Hood L: A new approach to decoding life: systems biology. Annu Rev Genomics Hum Genet 2001, 2: 343–372. 10.1146/annurev.genom.2.1.343View ArticlePubMedGoogle Scholar
- Fisher J, Henzinger TA: Executable cell biology. Nature biotechnology 2007, 25(11):1239–1249. 10.1038/nbt1356View ArticlePubMedGoogle Scholar
- Clarke EM Jr, Grumberg O, Peled DA: Model checking. Cambridge, MA, USA: MIT Press; 1999.Google Scholar
- Tyson JJ, Novák B: Functional Motifs in Biochemical Reaction Networks. Annual Review of Physical Chemistry 2010, 61: 219–240. 10.1146/annurev.physchem.012809.103457PubMed CentralView ArticlePubMedGoogle Scholar
- Glass L, Kauffman S: The logical analysis of continuous, non-linear biochemical control networks. Journal of Theoretical Biology 1973, 39: 103–129. 10.1016/0022-5193(73)90208-7View ArticlePubMedGoogle Scholar
- Chaouiya C: Petri net modelling of biological networks. Briefings in Bioinformatics 2007, 8(4):210–219. 10.1093/bib/bbm029View ArticlePubMedGoogle Scholar
- Steggles LJ, Banks R, Shaw O, Wipat A: Qualitatively modelling and analysing genetic regulatory networks: a Petri net approach. Bioinformatics 2007, 23(3):336–343. 10.1093/bioinformatics/btl596View ArticlePubMedGoogle Scholar
- Friedman N, Linial M, Nachman I, Pe'er D: Using Bayesian networks to analyze expression data. Journal of Computational Biology 2000, 7(3):601–620. 10.1089/106652700750050961View ArticlePubMedGoogle Scholar
- Karlebach G, Shamir R: Modelling and analysis of gene regulatory networks. Nat Rev Mol Cell Biol 2008, 9(10):770–780. 10.1038/nrm2503View ArticlePubMedGoogle Scholar
- Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U: Network motifs: simple building blocks of complex networks. Science (New York, N.Y.) 2002, 298(5594):824–827. 10.1126/science.298.5594.824View ArticleGoogle Scholar
- Shen-Orr S, Milo R, Mangan S, Alon U: Network motifs in the transcriptional regulation network of Escherichia coli. Nat Genet 2002, 31: 64–8. 10.1038/ng881View ArticlePubMedGoogle Scholar
- Alon U: Network motifs: theory and experimental approaches. Nat Rev Genet 2007, 8(6):450–461. 10.1038/nrg2102View ArticlePubMedGoogle Scholar
- Sneppen K, Krishna S, Semsey S: Simplified Models of Biological Networks. Annual Review of Biophysics 2010, 39: 43–59. 10.1146/annurev.biophys.093008.131241View ArticlePubMedGoogle Scholar
- Horvath P, Hunziker A, Erdossy J, Krishna S, Semsey S: Timing of gene transcription in the galactose utilization system of Escherichia coli. J Biol Chem 2010, 285(49):38062–8. 10.1074/jbc.M110.152264PubMed CentralView ArticlePubMedGoogle Scholar
- Vidal M, Cusick ME, Barabási ALL: Interactome networks and human disease. Cell 2011, 144(6):986–998. 10.1016/j.cell.2011.02.016PubMed CentralView ArticlePubMedGoogle Scholar
- Shin YJ, Nourani M: Statecharts for Gene Network Modeling. PLoS ONE 2010, 5(2):e9376. 10.1371/journal.pone.0009376PubMed CentralView ArticlePubMedGoogle Scholar
- Harel D: Statecharts: A visual formalism for complex systems. Sci Comput Program 1987, 8: 231–274. 10.1016/0167-6423(87)90035-9View ArticleGoogle Scholar
- Costanzo M, Baryshnikova A, Myers CL, Andrews B, Boone C: Charting the genetic interaction map of a cell. Curr Opin Biotechnol 2010, 22: 66–74.View ArticlePubMedGoogle Scholar
- Mangan S, Alon U: Structure and function of the feed-forward loop network motif. Proceedings of the National Academy of Sciences of the United States of America 2003, 100(21):11980–11985. 10.1073/pnas.2133841100PubMed CentralView ArticlePubMedGoogle Scholar
- Mangan S, Zaslaver A, Alon U: The coherent feedforward loop serves as a sign-sensitive delay element in transcription networks. J Mol Biol 2003, 334(2):197–204. 10.1016/j.jmb.2003.09.049View ArticlePubMedGoogle Scholar
- IBM Rational Rhapsody[http://www-01.ibm.com/software/awdtools/rhapsody/]
- IAR VisualSTATE[http://www.iar.com/en/products/iar-visualstate/]
- Softonic SmartState Studio[http://smartstate-studio.en.softonic.com/]
- Quantum Leaps State Machine[http://www.state-machine.com/]
- IntelliWizard UML StateWizard[http://www.intelliwizard.com/]
- Apache Commons SCXML[http://commons.apache.org/scxml/]
- Setty Y, Cohen IR, Dor Y, Harel D: Four-dimensional realistic modeling of pancreatic organogenesis. Proceedings of the National Academy of Sciences of the United States of America 2008, 105(51):20374–20379. 10.1073/pnas.0808725105PubMed CentralView ArticlePubMedGoogle Scholar
- Fisher J, Piterman N, Stern MJ, Harel D: Computational insights into Caenorhabditis elegans vulval development. Proceedings of the National Academy of Sciences of the United States of America 2005, 102(6):1951–1956. 10.1073/pnas.0409433102PubMed CentralView ArticlePubMedGoogle Scholar
- Efroni S, Harel D, Cohen IR: Toward rigorous comprehension of biological complexity: modeling, execution, and visualization of thymic T-cell maturation. Genome research 2003, 13(11):2485–2497. 10.1101/gr.1215303PubMed CentralView ArticlePubMedGoogle Scholar
- Shoval O, Alon U: SnapShot: Network Motifs. Cell 2010, 143(2):326.e1–326.e2.View ArticlePubMedGoogle Scholar
- Pomerening J, Sontag E, Ferrell J: Building a cell cycle oscillator: hysteresis and bistability in the activation of Cdc2. Nature Cell Biology 2003, 5(4):346–351. 10.1038/ncb954View ArticlePubMedGoogle Scholar
- Mangan S, Itzkovitz S, Zaslaver A, Alon U: The incoherent feed-forward loop accelerates the response-time of the gal system of Escherichia coli. Journal of Molecular Biology 2006, 356(5):1073–1081. 10.1016/j.jmb.2005.12.003View ArticlePubMedGoogle Scholar
- Rosenfeld N, Elowitz MB, Alon U: Negative Autoregulation Speeds the Response Times of Transcription Networks. Journal of Molecular Biology 2002, 323(5):785–793. 10.1016/S0022-2836(02)00994-4View ArticlePubMedGoogle Scholar
- Maeda Y, Sano M: Regulatory dynamics of synthetic gene networks with positive feedback. Journal of Molecular Biology 2006, 359(4):1107–1124. 10.1016/j.jmb.2006.03.064View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.