Skip to main content

SBMLKinetics: a tool for annotation-independent classification of reaction kinetics for SBML models

Abstract

Background

Reaction networks are widely used as mechanistic models in systems biology to reveal principles of biological systems. Reactions are governed by kinetic laws that describe reaction rates. Selecting the appropriate kinetic laws is difficult for many modelers. There exist tools that attempt to find the correct kinetic laws based on annotations. Here, I developed annotation-independent technologies that assist modelers by focusing on finding kinetic laws commonly used for similar reactions.

Results

Recommending kinetic laws and other analyses of reaction networks can be viewed as a classification problem. Existing approaches to determining similar reactions rely heavily on having good annotations, a condition that is often unsatisfied in model repositories such as BioModels. I developed an annotation-independent approach to find similar reactions via reaction classifications. I proposed a two-dimensional kinetics classification scheme (2DK) that analyzed reactions along the dimensions of kinetics type (K type) and reaction type (R type). I identified approximately ten mutually exclusive K types, including zeroth order, mass action, Michaelis–Menten, Hill kinetics, and others. R types were organized by the number of distinct reactants and the number of distinct products in reactions. I constructed a tool, SBMLKinetics, that inputted a collection of SBML models and then calculated reaction classifications as the probability of each 2DK class. The effectiveness of 2DK was evaluated on BioModels, and the scheme classified over 95% of the reactions.

Conclusions

2DK had many applications. It provided a data-driven annotation-independent approach to recommending kinetic laws by using type common for the kind of models in combination with the R type of the reactions. Alternatively, 2DK could also be used to alert users that a kinetic law was unusual for the K type and R type. Last, 2DK provided a way to analyze groups of models to compare their kinetic laws. I applied 2DK to BioModels to compare the kinetics of signaling networks with the kinetics of metabolic networks and found significant differences in K type distributions.

Peer Review reports

Background

Reaction networks are widely used in systems biology as mechanistic models that can reveal governing principles of biological systems.Footnote 1 The choice of kinetic laws for reactions (e.g., mass action kinetics, Michaelis–Menten kinetics) is crucial for the development of mechanistic models of biological systems. Computational studies of systems biology often involve the analysis of synthetically generated networks (e.g., [1, 2]). As noted in [3, 4], the choice of kinetic laws for these networks is as important as the set of reactions (the reactants, products, and modifiers for reactions) since reactions alone are not sufficient to determine dynamics [5]. Indeed, many computational studies only consider mass action kinetics, which may be inadequate [1]. In addition, kinetics are crucial considerations in parameter inference and model identification [6].

There is a wide diversity of kinetic laws. For example, allosteric and cooperative kinetics are important considerations in some biological systems [7, 8], and there are generalizations of these laws [9, 10], such as convenience kinetics [11]. Complications arise if some inhibitors or activators target the active site (non-allosteric). A further consideration is an order in which elemental reactions occur to form complexes such as compulsory order and ping-pong [12,13,14].

A significant motivation for this work was addressing the challenges modelers face with choosing appropriate kinetic laws. A detailed example of choosing kinetic laws is given for the models of glycolysis [15] and Trypanosoma models [16, 17]. Further, King–Altman [18] considers a much more complex reaction mechanism than deriving Michaelis–Menten kinetics using the steady-state assumption. Kinetic laws are chosen based on the best knowledge of the underlining kinetic mechanism. Some books describe kinetic laws, such as Segel’s enzyme kinetics [12] and others [13, 14].

Despite the examples, there remains a significant challenge with applying the theory in practice, especially for less experienced modelers. To address this, programs such as COPASI [19] and CellDesigner [20], provide pre-defined lists of kinetic laws. SBMLsqueezer 2 [21] goes further by making recommendations for kinetic laws in two ways. One way is a classification scheme that “depends on the annotations associated with each reactant, reaction and modulation”, which are “incorporated into the Systems Biology Markup Language (SBML) [22] specifications in the form of Systems Biology Ontology (SBO) [23]” [24]. “The most important sources of information in determining these categories are Minimal Information Required In the Annotation of Models (MIRIAM) [25] and SBO annotations” [21]. The other way also depends on annotations to match reactions in the SBAIO-RK [26] database to obtain kinetic laws. I emphasize that these two ways are limited because of the heavy dependence on annotations. Further, the authors note that even with annotations, there are serious limitations in that “the extraction from SABIO-RK depends on existing biochemical data and might therefore not always yield results.” Here, I expanded one aspect of the first way taken in SBMLsqueezer 2 to find similar reactions, whose core is to classify reactions. In SBMLsqueezer 2, this relies on annotations which include SBO and MIRIAM.

In this article, I developed a classification scheme that did not depend on annotations. Existing approaches to determining similar reactions rely heavily on having good annotations of chemical species, a condition that is often unsatisfied in model repositories such as BioModels. This article introduced 2DK, a new scheme for classifying reactions based on a two dimension structure: kinetics type (K type) and reaction type (R type). K types were mutually exclusive kinds of kinetic laws such as zeroth order, mass action, Michaelis–Menten, and Hill kinetics. R types classified reactions by the number of distinct reactants and products. I developed a Python-based cross-platform tool, SBMLKinetics, that constructed reaction classifications by inputting a collection of models and outputting the probability of each 2DK class (combinations of K type and R type).

I validated the 2DK classification scheme using BioModels Database. I further described how 2DK could provide a data-driven approach, with BioModels Database as an example, to recommending kinetic laws and providing an alert of unusual kinetic laws. Finally, I applied BioModels Database as an example again to compare the kinetics of signaling networks with the kinetics of metabolic networks. The results showed that the signaling networks were dominated by uni-directional mass action kinetics whereas metabolic networks had a proportionally larger number of regulatory kinetics in the form of kinetic laws that had a fractional representation. This is reasonable because metabolic models are derived heavily from underlying enzyme mechanisms whereas signaling pathways are generally simpler binding/unbinding reactions or gross simplifications of the kinetics because very little is known about the mechanism.

Methods

I developed an annotation-independent classification scheme for kinetic laws based on patterns in existing reaction networks. The focus was on the standard community of SBML. The approach classified reactions along two dimensions. The first was a characterization of the algebraic expression of the kinetic law. I referred to this as the kinetics type (K type) of the reaction. The second was a simplification of reaction stoichiometry, namely the number of distinct reactants and the number of distinct products. I referred to this as the reaction type (R type) of the reaction.

Define the two-dimensional kinetics classification scheme (2DK)

The two-dimensional kinetics classification scheme (2DK) organized reactions along two dimensions: kinetics type (K type) and reaction type (R type).

Define the kinetics type (K type)

On one hand, K type was defined mainly by the algebraic expressions of kinetic laws. For example, the kinetics with a reaction of \(A + B \rightarrow C\) and with its kinetic law \(K_1\) is different from the same reaction \(A + B \rightarrow C\) but with another kinetic law \(K_1\cdot A\cdot B\). On the other hand, K type also included some information from the reactions. For example, it was necessary to consider whether a species inside the kinetic law was a reactant or not. To evaluate the kinetics statistically, I classified the kinetics into commonly used and mutually exclusive ten types with defined categories as below, referring to Ontology Search (OLS) [27].

  • Zeroth order (ZERO) the number of species in the kinetic law was zero;

  • Uni-directional mass action (UNDR) the kinetic law was a single product of terms and all species in the kinetic law were reactants;

  • Uni-term with the moderator (UNMO) the kinetic law was a single product of terms and not UNDR, namely at least one species in the kinetic law was not a reactant;

  • Bi-directional mass action (BIDR) the kinetic law was the difference between two products of terms, and the first product of terms contained species as all the reactants while the second product of terms contained species as all the products;

  • Bi-terms with the moderator (BIMO) the kinetic law was the difference between two products of terms and NOT BIDR;

  • Michaelis–Menten kinetics without explicit enzyme (MM) the kinetic law was in the format of Michaelis–Menten expressions without an explicit enzyme;

  • Michaelis–Menten kinetics with an explicit enzyme (MMCAT) the kinetic law was in the format of Michaelis–Menten expressions with an explicit enzyme;

  • Hill equation (HILL) the kinetic law was in the format of Hill equations;

  • Fraction format other than MM, MMCAT, and HILL (FR) the kinetic law was in the format of fraction with at least one species in the denominator and NOT MM, MMCAT, or HILL;

  • Not classified (NA) not classified kinetics.

Table 1 provided an example for each certain K type. In particular, the type FR included generalized kinetics with the kinetic law in the format of fraction based on but beyond Michaelis–Menten and Hill kinetics. All the information in Table 1 other than K type was from the SBML files in the BioModels Database. There are four columns: “K Type”, “BIOMD” (BioModel identifier number), “Reaction” and “Kinetic law”. In the column of “BIOMD”, i.e., 5 means BIOMD0000000005. In the “Kinetic law” column, all the species in the kinetic law are in bold. As a comparison, all the parameters are not in bold. For instance, variable compartment sizes were considered as parameters. The K types did not consider the values of the parameters; therefore, units were not considered here either.

Table 1 Kinetics examples for ten defined kinetics types (K type)

OLS lists the mathematical expressions of kinetic laws in SBO as four types: mass action kinetic laws, modular kinetic laws, enzymatic kinetic laws, and Hill-type kinetic laws. Table 2 described how the defined K types correspond to the mathematical expressions of kinetic laws in SBO.

Table 2 Comparison between kinetics types (K types) and Systems Biology Ontology (SBO) in Ontology Search (OLS)

Define the reaction type (R type)

R type was defined by the structure of reactions. It was a simplification of reaction stoichiometry in that only the number of distinct reactants and products were considered. For example, two reactants and one product are in the reaction of \(A + B \rightarrow C\). I defined R type quantitatively with a pair of the number of reactants (R = 0, 1, 2, > 2) and products (P = 0, 1, 2, > 2). As a comparison, the stoichiometry matrix indicates reactants and products by negative and positive signs respectively. The 2DK classification was a combination of the K type and the R type.

Code structure and statistical analysis

I developed a Python package called SBMLKinetics that read a collection of SBML models and then did statistical analysis for the 2DK clarifications. There were four major workflow steps in SBMLKinetics implemented by three Python scripts as shown in Fig. 1.

Fig. 1
figure 1

Code structure and workflow steps of SBMLKinetics

Expand the kinetic law formula if applicable

Before analyzing kinetics, I expanded the kinetic law formula if it was in the format with a function name. In the SBML files, some kinetic laws are expressed via specific function names. Therefore, the function names are supposed to get replaced by certain function bodies. For instance, the kinetic law is sometimes defined by f(t). Then, it is necessary to expand f(t) to its explicit equation \(at+b\) based on the definition of \(f(x) = ax + b\) in the SBML file.

Analysis of the kinetic laws and reactions

K types were determined by a collection of eight kinetics properties (K properties) listed below that were ascertained from analyzing the reaction’s kinetic law.

  1. (a)

    the number of species in the kinetic law;

  2. (b)

    whether the kinetic law was a single product of terms;

  3. (c)

    whether the kinetic law was the difference between two products of terms;

  4. (d)

    whether the first (and the second) product of terms contained species as all the reactants (and products);

  5. (e)

    whether the kinetic law was in the format of Michaelis–Menten expressions with a single reactant in the numerator;

  6. (f)

    whether the kinetic law was in the format of Michaelis–Menten expressions with the product of a single reactant and another species in the numerator;

  7. (g)

    whether the kinetic law was in the format of Hill equation;

  8. (h)

    whether the kinetic law was in the fraction format with at least one species in the denominator.

The logic for determining the K type based on K properties was described in Table 3. For example, if the number of species in the kinetic law was zero according to K property a, then the K type was ZERO. If both K properties b and d held, then the K type was UNDR. The analysis proceeded column by column in Table 3. Note that the classifications were mutually exclusive since all pairs of columns differed in at least one non-empty row. For example, UNDR and UNMO differed in the K property d. To illustrate the above analysis, consider the kinetic law cell\(\cdot\)CP\(\cdot\)k9 of the reaction CP \(\rightarrow\) C2 in the second row of Table 1. The K property b (it was a single product of terms) and d (all reactants were in the kinetic law) both held. Therefore, I saw that its K type was UNDR based on Table 3.

Table 3 The determination of kinetics types (K types) by kinetics properties (K properties)

To classify the R types, I considered the number of distinct reactants and products. R type was quantitatively represented by the number of reactants (R = 0, 1, 2, > 2) and products (P = 0, 1, 2, > 2).

Statistical analysis for different kinetics

After analyzing the kinetic laws with the strategies in steps 1 and 2, I did a statistical analysis to obtain the following information.

  • Detailed Information for each reaction including BioModel identifier number, reaction id, reaction, kinetic law, and K type.

  • Detailed information for each BioModel including BioModel identifier number, the number of reactions, and the distributions of ten K types per model.

Application programming interface (API) for users

Finally, users could access the results via APIs in the following perspectives.

  • Query distributions of K type and R type.

  • Query elements of (top) K type and R type.

  • Plots for the distributions of K type and R type.

All the API details were documented and available at GitHub (https://sunnyxu.github.io/SBMLKinetics/).

Uniqueness

The cross-platform Python-based tool, SBMLKinetics, constructed reaction classifications by inputting a collection of SBML models and outputting the probability of each 2DK class (combinations of K type and R type). The kinetic analysis was based on python-libSBML [28], which supports SBML in levels 1–3 [29,30,31]. Therefore, SBMLKinetics could rely on any SBML database in levels 1–3.

To compare with SBMLsqueezer 2 [21] which is the only tool for large-scale biochemical kinetic models, I summarized its major differences from SBMLKinetics in Table 4. SBMLsqueezer 2 uses a classification scheme, called de novo creation method, that “considers the annotation of all participating reactants, products, and regulators” [24]. It contains a connection to the kinetics database to recommend kinetic laws. In detail, its extraction of kinetic laws from the SABIO-RK [26] database is based on the similarity of annotations which cannot always yield results [21]. Briefly, SBMLsqueezer uses a creative method to classify kinetics that relies heavily on the information from annotations. However, annotations are not always available in SBML files in model repositories such as BioModels.

Table 4 Comparison between SBMLKinetics and SBMLsqueezer 2

The tool of SBMLKinetics made use of an annotation-independent classification scheme to deal with the kinetics analysis based on the SBML database, which could include all the current and future information from the SBML model database and experimental results. The categories in the de novo creation of SBMLsqueezer 2 are not exclusive to each other as the 2DK of SBMLKinetics. In comparison with the de novo creation, 2DK had covered all the categories of the de novo creation except the perspectives of reversible or irreversible reactions. Table 5 provided more details.

Table 5 Comparison between de novo creation in SBMLsqueezer 2 and 2DK in SBMLKinetics

Results and validation

I considered a classification scheme to be useful if: (a) the classification categories provided useful insights, and (b) the scheme could classify a large fraction of items. The discussion in the “Methods” section supported (a) that 2DK classifications aligned well with easily identifiable properties of reactions. Here, I addressed (b). In particular, I calculated the coverage of 2DK, namely the fraction of the kinetic laws of reactions in curated BioModels that could be classified using 2DK, which turned out to be approximately 95%. In the following subsections, I presented more detailed information about K type and R type distributions.

Kinetics type distribution

There was possible kinetics like zeroth order, mass action, Michaelis–Menten, Hill kinetics, and others. See the “Methods” section for details. Figure 2 characterized the kinetics used in the BioModels Database. The classification of the ten types was exclusive. Therefore, the sum of all the probabilities was one. Figure 2 showed the distribution of K types from 931 curated BioModels with 30,592 reactions. The top K type was UNDR, and 4.37% reactions were not classified. The relatively small number of unclassified reactions did suggest that the selected classification schemes were reasonable. The running time was 9.2 h. All computations reported were done using an Intel i7 9700 processor running at 3.00 GHz with 32 GB RAM on Windows 10.

Fig. 2
figure 2

Kinetics type distribution. The blue bars investigated the kinetics type distribution of the average of all the reactions from all the models. The orange bars indicated the distribution of the average of reactions for each model. The trend for the two color bars was qualitatively similar. The error bars on the top of the orange bars represented the uncertainty among different models

Reaction type distribution

To classify the reaction types (R types), I considered the number of distinct reactants and products. R type was quantitatively represented by the number of distinct reactants (R = 0, 1, 2, > 2) and products (P = 0, 1, 2, > 2). The classification of all the reaction types was exclusive. The sum of all the probabilities was one, which meant that I covered all the curated BioModels from the perspective of R type. Figure 3 showed the distribution of R types from 931 curated BioModels with 30,592 reactions. The top R type was with a single reactant and a single product.

Fig. 3
figure 3

Reaction type distribution. A investigated the reaction type distribution of the average of all the reactions from all the models. B indicated the distribution of the average of reactions for each model. The trend for the two subplots was qualitatively similar

There are several models in the Database, and there are several reactions in each model. In Fig. 2, the blue bars investigated the K type distribution of the average of all the reactions from all the models. In detail, I counted the number of reactions in all the models as \(R_{all{\text{-}}models}\). Then, I counted the percentage of reactions for each K type among all the reactions (\(R_{all{\text{-}}models}\)). The orange bars indicated the distribution of the average of reactions for each model. The error bars on the top of the orange bars represented the uncertainty among different models, which was relatively small. In detail, I first counted the number of reactions in each model as \(R_{per{\text{-}}model}\). Then, I counted the percentage of reactions for each K type among the reactions in the certain model (\(R_{per{\text{-}}model}\)). The value of the distribution was the average of the percentage among all the models. The error bar was the standard error among all the models. For each K type distribution in Fig. 2, the trend for the two color bars was qualitatively similar with relatively small error bars. Therefore, the statistics were meaningful for the K type distributions. In Fig. 3, the trend for the two subplots was qualitatively similar which did not depend on a certain model. Therefore, the R type distribution was meaningful too.

Applications and discussion

2DK could provide a data-driven approach to recommending kinetic laws and providing an alert of unusual kinetic laws. I applied the 2DK method and the tool SBMLKinetics to the BioModels Database as an example of analyzing kinetics from different reaction networks, although the tool could be applied to any repository of SBML kinetics models. The examples could illustrate the potential applications of the method and how to use the tool. In the following subsections, I indicated how the variety of K type distributions depended on different R types. In addition, I compared two subsets of BioModels including signaling networks and metabolic networks.

Kinetics type distributions for different reaction types

I investigated the distribution of kinetics based on reaction types and found substantial differences among them (Fig. 4). Focusing on the four types in the center with the most reactions involved (R = 1, P = 1; R = 2, P = 1; R = 1, P = 2; R = 2, P = 2), these four subplots were quite different from each other. The top K type with one reactant was UNDR significantly, while the distributions became more evenly with two reactants.

Fig. 4
figure 4

Kinetics type distribution for different reaction types. For each reaction type, represented by the number of distinct reactants and products (marked as the black text in each subplot), a kinetics type distribution was given. The blue text in the top left corner represented the percentage of reactions involved in a certain reaction type. The orange text in the top right corner represented the percentage of reactions per model. The blue bars investigated the kinetics type distribution of the average of all the reactions from all the models. The orange bars indicated the distribution of the average of reactions for each model. The trend for color bars was qualitatively similar for each reaction type. The error bars on the top of the orange bars represented the uncertainty among different models

Figure 4 investigates the significant differences in K type distributions among R types. The blue text in the top left corner represented the percentage of reactions involved in the certain R type, which was more clearly illustrated in Fig. 3A. The orange text in the top right corner represented the percentage of reactions per model, which was more clearly indicated in Fig. 3B. Following the observation of the blue and orange texts, the most frequent six R types were (R = 1, P = 0; R = 0, P = 1; R = 1, P = 1; R = 2, P = 1; R = 1, P = 2; R = 2, P = 2). In Fig. 4, the K type distributions were non-symmetric depending on different R types, which meant the effects from the number of distinct reactants and products were not equal. Based on Fig. 4, researchers could tell which R type had the highest or lowest K type. For example, the R type with R = 2, P = 1 had the highest BIDR among all the R types. While there was the highest NA from the R type with R > 2, P = 1. It was also interesting to compare per column (with the same number of reactants) or row (with the same number of products). For instance, the third row indicated that the FR became more frequent as the number of reactants increased with a constant number of the product (P = 2).

Comparison between different subsets of BioModels

This characterization considered the biology being studied as well as the nature of the reactions themselves. For instance, I compared the K type distributions for signaling and metabolic networks in the BioModels Database as an example. For the details of selecting the two subsets, see the Availability of data and materials under the Declarations section. I found substantial differences between the two types of networks in Fig. 5. Figure 5A showed the distribution of K types from 288 signaling networks with 13,572 reactions. The top K type was UNDR, and 1.84% reactions were not classified. The running time was 6.9 h. Figure 5B showed the distribution of K types from 168 metabolic networks with 6,904 reactions. The top K type was FR, and 9.85% reactions were not classified. The running time was 8.1 h. The K type distribution of signaling networks was similar to the whole curated BioModels in Fig. 2. However, the metabolic networks were quite different with high frequencies of ZERO and FR.

The significant difference between signaling networks and metabolic networks could come from regulatory interactions. The high frequency of ZERO and FR from metabolic networks was the kinetics which could be very simple without any species inside the kinetic law or a very complicated kinetic law that had a fractional representation.

Fig. 5
figure 5

Kinetics type distribution for signaling networks (A) and metabolic networks (B). The blue bars investigated the kinetics type distribution of the average of all the reactions from all the models. The orange bars indicated the distribution of the average of reactions for each model. The error bars on the top of the orange bars represented the uncertainty among different models. The trend for the two color bars was qualitatively similar for each subplot. However, there was a significant difference in the kinetics type distributions between the signaling and metabolic networks

The different K type distributions among different data sets, if applied to synthetic random reaction networks, could provide more representative kinetic laws for these studies. As an example to indicate how this method could contribute to generating synthetic reaction networks, I made use of the results from the BioModels Database as an example. However, users could generate a distribution based on their database for further research. Based on the result of Fig. 2 from curated BioModels, users could select UNDR with the highest probability as the general mass action kinetics to model synthetic reaction networks. Users could also follow its distribution to assign the probability for each type of kinetics while generating random synthetic reaction networks. In detail, users could generate reaction networks by assigning the probability of certain kinetics, i.e., ZERO, UNDR, UNMO, etc. Researchers commonly use simplified mass action kinetics as a first trial while modeling synthetic reaction networks [1], therefore, users could choose a certain distribution referring to Fig. 4 if they had already known the R type information. For example, if there were one reactant and one product in the reaction, UNDR was the most common kinetics to choose. While BIDR was more likely to get chosen for the case with two reactants and one product. If users wanted to model synthetic signaling networks instead of metabolic networks, they could refer to the distributions of Fig. 5A instead of Fig. 5B.

Conclusions

SBMLKinetics could analyze any data sets with SBML files as input and generate the analysis of kinetics. Here, I used the BioModels Database as an SBML database example to show the validation and potential applications of 2DK. However, users could use their experimental data or another model database to look for natural biological properties. See the Availability of data and materials under the Declarations section.

Therefore, 2DK could provide a data-driven approach to recommending kinetic laws and providing an alert of unusual kinetic laws. This work had the potential application to computational studies based on random reaction networks so that more representative kinetic laws were used in these studies. As I had discussed the relationship between K type and R type, this tool could suggest the probabilities of K type for each R type. I could also observe how the K type distributions vary according to different R types. The methods could make suggestions to choose K type distributions depending on different models, for instance, signaling versus metabolic networks. Statistically, the probability distributions of K types could be a prior for kinetic law generations, as Bayesian inference has been discussed about kinetics before [32, 33].

What if the kinetic laws inside a model were significantly different from the distributions of the K type or R type calculated by the reference models? It could mean that the model had errors in its kinetics. At a minimum, the presence of such differences should be explored by the modeler. But such differences might also reflect substantially different biology or chemistry that was under study.

SBMLKinetics could be useful to pursue biological insights from updating models or experimental databases. For example, it is essential to compare the kinetics among different data sets, i.e., different organisms. The future potential work could be further applications of 2DK. Applications could include a complete recommending system for specific kinetic laws, detecting errors in kinetic laws, and seeking the biology insights of models based on such classifications.

Availability of data and materials

SBMLKinetics was a publicly available Python package (https://pypi.org/project/SBMLKinetics/) and was licensed under the liberal MIT open-source license. The package was available for all major platforms. The source code had been deposited on GitHub (https://github.com/SunnyXu/SBMLKinetics). Users could install the package using the standard pip installation mechanism: pip install SBMLKinetics. The package was fully documented as https://sunnyxu.github.io/SBMLKinetics/. The specific version of the tool SBMLKinetics used for the analysis and results presented in the article was version 1.0.1. As BioModels Database is changing and improving over time, all the data sets to produce the results and analysis in the article were available in the GitHub repository (https://github.com/SunnyXu/SBMLKinetics/tree/master/data). Initially, the metabolic datasets were downloaded from the BioModels Database website (https://www.ebi.ac.uk/biomodels/), by clicking on the “Manually Curated” button, selecting the “SBML” model format, and typing “AND “Metabolic” in the searching engine. For signaling networks, both “Signaling” and “Signalling” were considered.

Notes

  1. Abbreviations: The two-dimensional kinetics classification scheme (2DK) organized reactions along two dimensions: kinetics type (K type) and reaction type (R type). (I) Kinetics type (K type) . K type includes ten types: ZERO: Zeroth order; UNDR: Uni-directional mass action; UNMO: Uni-term with the moderator; BIDR: Bi-directional mass action; BIMO: Bi-terms with the moderator; MM: Michaelis–Menten kinetics without an explicit enzyme; MMCAT: Michaelis–Menten kinetics with an explicit enzyme; HILL: Hill kinetics; FR: Kinetic law in the fraction format other than MM, MMCAT, or HILL; NA: not classified. (II) Reaction type (R type). R type was quantitatively represented by the number of distinct reactants (R = 0, 1, 2, >2) and products (P = 0, 1, 2, >2).

References

  1. Riva S, Cazzaniga P, Nobile M, Spolaor S, Rundo L, Besozzi D, Tangherloni A. SMGen: a generator of synthetic models of biochemical reaction networks. Symmetry. 2022;14:119. https://doi.org/10.3390/sym14010119.

    Article  CAS  Google Scholar 

  2. Schaffter T, Marbach D, Floreano D. GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods. Bioinformatics (Oxford, England). 2011;27:2263–70. https://doi.org/10.1093/bioinformatics/btr373.

    Article  CAS  PubMed  Google Scholar 

  3. Shen-Orr S, Milo R, Mangan S, Alon U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat Genet. 2002;31:64–8. https://doi.org/10.1038/ng881.

    Article  CAS  PubMed  Google Scholar 

  4. Alon U. Network motifs: theory and experimental approaches. Nat Rev Genet. 2007;8:450–61. https://doi.org/10.1038/nrg2102.

    Article  CAS  PubMed  Google Scholar 

  5. Ronen M, Rosenberg R, Shraiman B, Alon U. Assigning numbers to the arrows: parameterizing a gene regulation network by using accurate expression kinetics. Proc Natl Acad Sci USA. 2002;99:10555–60. https://doi.org/10.1073/pnas.152046799.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Loskot P, Atitey K, Mihaylova L. Comprehensive review of models and methods for inferences in bio-chemical reaction networks. Front Genet. 2019;10:549.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Christopoulos A, Lanzafame A, Mitchelson F. Allosteric interactions at muscarinic cholinoceptors. Clin Exp Pharmacol Physiol. 1998;25(3–4):185–94. https://doi.org/10.1111/j.1440-1681.1998.t01-4-.x.

    Article  CAS  PubMed  Google Scholar 

  8. Nussinov R, Tsai CJ, Liu J. Principles of allosteric interactions in cell signaling. J Am Chem Soc. 2014;136:17692–701. https://doi.org/10.1021/ja510028c.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Liebermeister W, Uhlendorf J, Klipp E. Modular rate laws for enzymatic reactions: thermodynamics, elasticities and implementation. Bioinformatics (Oxford, England). 2010;26:1528–34. https://doi.org/10.1093/bioinformatics/btq141.

    Article  CAS  PubMed  Google Scholar 

  10. Rohwer J, Hanekom A, Hofmeyr J-H. A universal rate equation for systems biology. 2007, pp. 175–187.

  11. Liebermeister W, Klipp E. Bringing metabolic networks to life: convenience rate law and thermodynamic constraints. Theor Biol Med Model. 2006;3:41. https://doi.org/10.1186/1742-4682-3-41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Wagner M. Enzyme kinetics, behavior and analysis of rapid equilibrium and steady-state enzyme systems (Segel, Irwin H.). J Chem Educ. 1976;53(11):472. https://doi.org/10.1021/ed053pA472.

    Article  Google Scholar 

  13. Bisswanger H. Enzymkinetik: theorie und methoden. Berlin: Wiley-Blackwell; 2000.

    Book  Google Scholar 

  14. Cornish-Bowden A. Fundamentals of enzyme kinetics. London: Portland Press Ltd.; 2004.

    Google Scholar 

  15. Snoep J. The silicon cell initiative: working towards a detailed kinetic description at the cellular level. Curr Opin Biotechnol. 2005;16:336–43. https://doi.org/10.1016/j.copbio.2005.05.003.

    Article  CAS  PubMed  Google Scholar 

  16. Bakker B, Michels P, Opperdoes F, Westerhoff H. Glycolysis in bloodstream form Trypanosoma brucei can be understood in terms of the kinetics of the Glycolytic enzymes. J Biol Chem. 1997;272:3207–15. https://doi.org/10.1074/jbc.272.6.3207.

    Article  CAS  PubMed  Google Scholar 

  17. Helfert S, Estévez AM, Bakker B, Michels P, Clayton C. Roles of triosephosphate isomerase and aerobic metabolism in Trypanosoma brucei. Biochem J. 2001;357(Pt 1):117–25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Lam CF, Priest DG. Enzyme kinetics: systematic generation of valid King–Altman patterns. Biophys J. 1972;12(3):248–56. https://doi.org/10.1016/S0006-3495(72)86084-3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Hoops S, Sahle S, Gauges R, Lee C, Pahle J, Simus N, Singhal M, Xu L, Mendes P, Kummer U. COPASI—a complex pathway simulator. Bioinformatics (Oxford, England). 2007;22:3067–74. https://doi.org/10.1093/bioinformatics/btl485.

    Article  CAS  Google Scholar 

  20. Funahashi A, Jouraku A, Matsuoka Y, Kitano H. Integration of cell designer and SABIO-RK. In Silico Biol. 2007;7:81–90.

    Google Scholar 

  21. Dräger A, Zielinski D, Keller R, Rall M, Eichner J, Palsson B, Zell A. SBMLsqueezer 2: context-sensitive creation of kinetic equations in biochemical networks. BMC Syst Biol. 2015;9:68. https://doi.org/10.1186/s12918-015-0212-9.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Keating S, Waltemath D, Helikar T, Hucka M, et al. SBML level 3: an extensible format for the exchange and reuse of biological models. Mol Syst Biol. 2020;16:e9110.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Courtot M, Juty N, Knüpfer C, Waltemath D, Zhukova A, Dräger A, Dumontier M, Finney A, Golebiewski M, Hastings J, Hoops S, Keating S, Kell D, Kerrien S, Lawson J, Lister A, Lu J, Machne R, Mendes P, Le Novere N. Controlled vocabularies and semantics in systems biology. Mol Syst Biol. 2011;7:543. https://doi.org/10.1038/msb.2011.77.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Dräger A, Hassis N, Supper J, Schroeder A, Zell A. SBMLsqueezer: a cell designer plug-in to generate kinetic rate equations for biochemical networks. BMC Syst Biol. 2008;2:39. https://doi.org/10.1186/1752-0509-2-39.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Le Novere N, Finney A, Hucka M, Bhalla U, Campagne F, Crampin E, Halstead M, Klipp E, Mendes P, Nielsen P, Sauro H, Shapiro B, Snoep J, Spence H, Wanner B. Minimum information requested in the annotation of biochemical models (MIRIAM). Nat Biotechnol. 2006;23:1509–15. https://doi.org/10.1038/nbt1156.

    Article  CAS  Google Scholar 

  26. Wittig U, Kania R, Golebiewski M, Rey M, Shi L, Jong L, Algaa E, Weidemann A, Sauer-Danzwith H, Mir S, Krebs O, Bittkowski M, Wetsch E, Rojas I, Müller W. SABIO-RK–database for biochemical reaction kinetics. Nucleic Acids Res. 2011;40:790–6. https://doi.org/10.1093/nar/gkr1046.

    Article  CAS  Google Scholar 

  27. Jupp S, Burdett T, Leroy C, Parkinson HE. A new ontology lookup service at EMBL-EBI. In: SWAT4LS. 2015.

  28. Bornstein BJ, Keating SM, Jouraku A, Hucka M. LibSBML: an API library for SBML. Bioinformatics. 2008;24(6):880–1.

    Article  CAS  PubMed  Google Scholar 

  29. Hucka M, Finney A, Sauro HM, Bolouri H. Systems biology markup language (SBML) level 1: structures and facilities for basic model definitions. California Institute of Technology. Available from COMBINE at http://identifiers.org/combine.specifications/sbml.level-1.version-2. 2003. http://identifiers.org/combine.specifications/sbml.level-1.version-2.

  30. Hucka M, Bergmann FT, Dräger A, Hoops S, Keating SM, Le Novère N, Myers CJ, Olivier BG, Sahle S, Schaff JC, Smith LP, Waltemath D, Wilkinson DJ. Systems biology markup language (SBML) level 2 version 5: structures and facilities for model definitions. J Integr Bioinform. 2015;12(2):271. https://doi.org/10.2390/biecoll-jib-2015-271.

    Article  PubMed  Google Scholar 

  31. Hucka M, Bergmann FT, Chaouiya C, Dräger A, Hoops S, Keating SM, König M, Le Novère N, Myers CJ, Olivier BG, Sahle S, Schaff JC, Sheriff R, Smith LP, Waltemath D, Wilkinson DJ, Zhang F. The systems biology markup language (SBML): language specification for level 3 version 2 core release 2. J Integr Bioinform. 2019;16(2):20190021. https://doi.org/10.1515/jib-2019-0021.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Golightly A, Wilkinson D. Bayesian inference for stochastic kinetic models using a diffusion approximation. Biometrics. 2005;61:781–8. https://doi.org/10.1111/j.1541-0420.2005.00345.x.

    Article  CAS  PubMed  Google Scholar 

  33. Galagali N, Marzouk Y. Bayesian inference of chemical kinetic models from proposed reactions. Chem Eng Sci. 2015;123:170–90. https://doi.org/10.1016/j.ces.2014.10.030.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

J.X. thanks Joseph L. Hellerstein for introducing the problems considered in this paper, many stimulating discussions, his comments on the manuscript, and a grammar check. J.X. appreciates Herbert M. Sauro’s discussions regarding kinetics and synthetic reaction networks cited as the references, and for his comments on the early stage of the manuscript. J.X. also thanks Lucian Smith for the discussion about the kinetic law formula expansion based on libSBML. Finally, J.X. appreciates the valuable comments from the reviewers.

Funding

J.X. who performed the research reported in this article was supported by NIBIB of the National Institutes of Health under award number U24EB028887.

Author information

Authors and Affiliations

Authors

Contributions

JX implemented the project, did the data analysis, and was the main developer of the Python package. JX wrote the manuscript.

Corresponding author

Correspondence to Jin Xu.

Ethics declarations

Ethics approval and consent to participate

The data that I used was obtained from the public datasets BioModels Database (https://www.ebi.ac.uk/biomodels/). Therefore, the ethics approval did not apply to this study.

Consent for publication

Not applicable.

Competing interests

The author declared that she had no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, J. SBMLKinetics: a tool for annotation-independent classification of reaction kinetics for SBML models. BMC Bioinformatics 24, 248 (2023). https://doi.org/10.1186/s12859-023-05380-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12859-023-05380-3

Keywords