The JBEI quantitative metabolic modeling library (jQMM): a python library for modeling microbial metabolism

Birkel, Garrett W.; Ghosh, Amit; Kumar, Vinay S.; Weaver, Daniel; Ando, David; Backman, Tyler W. H.; Arkin, Adam P.; Keasling, Jay D.; Martín, Héctor García

doi:10.1186/s12859-017-1615-y

Software
Open access
Published: 05 April 2017

The JBEI quantitative metabolic modeling library (jQMM): a python library for modeling microbial metabolism

Garrett W. Birkel^1,2,8,
Amit Ghosh^1,2,6,
Vinay S. Kumar^1,2,
Daniel Weaver^1,2,
David Ando^1,2,
Tyler W. H. Backman^1,2,8,
Adam P. Arkin^1,4,5,
Jay D. Keasling^1,2,3,4,7 &
…
Héctor García Martín ORCID: orcid.org/0000-0002-4556-9685^1,2,8,9

BMC Bioinformatics volume 18, Article number: 205 (2017) Cite this article

5389 Accesses
15 Citations
19 Altmetric
Metrics details

An Erratum to this article was published on 18 April 2017

Abstract

Background

Modeling of microbial metabolism is a topic of growing importance in biotechnology. Mathematical modeling helps provide a mechanistic understanding for the studied process, separating the main drivers from the circumstantial ones, bounding the outcomes of experiments and guiding engineering approaches. Among different modeling schemes, the quantification of intracellular metabolic fluxes (i.e. the rate of each reaction in cellular metabolism) is of particular interest for metabolic engineering because it describes how carbon and energy flow throughout the cell. In addition to flux analysis, new methods for the effective use of the ever more readily available and abundant -omics data (i.e. transcriptomics, proteomics and metabolomics) are urgently needed.

Results

The jQMM library presented here provides an open-source, Python-based framework for modeling internal metabolic fluxes and leveraging other -omics data for the scientific study of cellular metabolism and bioengineering purposes. Firstly, it presents a complete toolbox for simultaneously performing two different types of flux analysis that are typically disjoint: Flux Balance Analysis and ¹³C Metabolic Flux Analysis. Moreover, it introduces the capability to use ¹³C labeling experimental data to constrain comprehensive genome-scale models through a technique called two-scale ¹³C Metabolic Flux Analysis (2S-¹³C MFA). In addition, the library includes a demonstration of a method that uses proteomics data to produce actionable insights to increase biofuel production. Finally, the use of the jQMM library is illustrated through the addition of several Jupyter notebook demonstration files that enhance reproducibility and provide the capability to be adapted to the user’s specific needs.

Conclusions

jQMM will facilitate the design and metabolic engineering of organisms for biofuels and other chemicals, as well as investigations of cellular metabolism and leveraging -omics data. As an open source software project, we hope it will attract additions from the community and grow with the rapidly changing field of metabolic engineering.

Background

Metabolism is the set of physical and chemical processes through which living organisms extract energy and mass from their environment in order to sustain life [1]. Hence, an understanding of metabolism is fundamental in order to set the stage for, and understand the physical limitations of, every biological process. For example, metabolic disorders lie at the heart of a variety of human illnesses such as diabetes [2], obesity [3] and cancer [4].

Understanding of metabolism is also of central importance to the microbial production of drugs, bioproducts and biofuels through metabolic engineering [5]. This application of microbial bioengineering to produce biofuels and other commodity products has attracted significant attention due to its potential to reduce the CO₂ emissions which cause climate change, reduce society’s reliance on fossil fuels for energy and chemicals, and provide for enhanced energy security. The production of commodity chemicals from lignocellulosic biomass is critical for achieving these goals, particularly in view of expected future population growth in emergent economies coupled with a marked improvement in their living standards [6].

Metabolic engineering rarely succeeds at the first attempt to introduce and activate a heterologous pathway in a host organism. Rather, most of the time in this process is spent troubleshooting the pathways for acceptable production [7]. Very often, the metabolic engineer diagnoses the problem in the bioengineered microbe by relying on a variety of -omics data (transcriptomics, proteomics, metabolomics) that are expected to provide an explanation of why the system does not behave as expected [8–10]. However, this diagnosing through -omics data is a non trivial process: often the amount of data is much larger than what metabolic engineers are typically trained to analyze, and it is non-trivial to extract actionable items (e.g. overexpress this specific gene or change that promoter) that will reliably improve the system’s behavior [11, 12]. To complicate matters, the reason for the underperformance of a given pathway may not rely on the pathway itself but on other parts of the host metabolism that affect the pathway’s performance in intricate and non-direct ways (e.g. cofactor availability or gene regulation) [13].

Mathematical modeling helps contextualize experimental data into a model that provides an explanation of the observed phenomena and provides predictions for changes in the system under study. Metabolic modeling, in particular, has been successfully used to e.g. increase product yield [14] or identify bottlenecks and competing pathways [15]. Moreover, data analysis and modeling is expected to increase in scope and importance as the creation of increasingly larger sets of -omics data sets are enabled by the rapid drop of DNA sequencing costs [16] and the availability of high-throughput mass spectrometry workflows [17, 18]. Hence, research institutes such as the Joint BioEnergy Institute (JBEI), tasked to produce the scientific and technological underpinnings that will enable lignocellulosic biofuels [19], have devoted research effort not only to develop drop-in biofuels [20] but also tools to speed up bioengineering through mathematical modeling.

Some of the mathematical tools created at JBEI to enable better prediction of bacterial and eukaryotic metabolism are presented here in the form of the JBEI Quantitative Metabolic Modeling library (jQMM). The jQMM library includes algorithms to measure and predict internal metabolic fluxes using three different techniques: ¹³C Metabolic Flux Analysis (¹³C MFA [21]), Flux Balance Analysis (FBA [22]) and two-scale ¹³ C Metabolic Flux Analysis (2S-¹³C MFA [23]). It also includes methods to produce actionable insights from -omics data to improve pathway yield [24], and methodologies for the flux analysis of microbial communities [25]. We hope to add to the jQMM library in the future tools and algorithms currently being developed at JBEI, as well other institutions who may consider contributing to this library.

Methods

Key capabilities

The JBEI QMM library includes code to perform the following techniques: FBA [22], ¹³C MFA [21], 2S-¹³C MFA [23], Principal Component Analysis of Proteomics (PCAP [24]), and ¹³C MFA for microbial communities [25]. Mathematical details for each method are provided in the supplementary material (Additional file 1). FBA, ¹³C MFA and 2S-¹³C MFA are techniques used to calculate internal metabolic fluxes; detailed explanations regarding the differences between each method can be found in Garcia Martin et al. [23]. This library contains sample demonstration scripts to, for example, replicate all the figures from Garcia Martin et al. in the form of Jupyter notebooks (see Table 1). The code in the jQMM library focuses on ¹³C MFA, 2S-¹³C MFA, since there is already an open source python-based library [26] for FBA. For ¹³C MFA there exists some closed-source packages (e.g. METRAN [27], INCA [28], 13CFLUX2 [29]), a few open-source packages (OPENFLUX [30] and OPENFLUX2 [31]) but no open source python-based libraries. For 2S-¹³C MFA, jQMM is the first public library.

Table 1 Table of iPython Notebooks

Full size table

Implementation

jQMM is implemented in Python 2.7, and we have aimed to provide a code base that is as modular as possible, in order to facilitate understanding and reusability. The code is divided into a set of modules that incorporate classes and methods which will typically be used together. Figure 1 details the different modules and how they interact with each other for flux-based analysis. The following subsections provide a brief explanation of the purpose of each module and Figs. 2, 3 and 4 (and Additional file 1: Figures S1-S3) provide graphical depictions of each module together with their main classes and methods. Tutorials are provided for each module, along with unit tests.

In order to facilitate reproducibility, the lack of which represents one of the main problems afflicting biological research [32], all results and tutorials are provided in the form of Jupyter notebooks [33, 34] (see Table 1 and jupyter.org). This approach is meant to facilitate the reproduction of published results and the creation of new results based on these. The Jupyter notebooks are divided into two types: type A demonstrates the library’s capabilities and type B reproduces published results using code from the jQMM library (Table 1). To further enhance reproducibility we have provided a docker container in the github repository that includes all dependencies. Docker (http://www.docker.com) containers wrap a piece of software in a complete filesystem containing everything needed to run: code, systems libraries, systems tools and anything else that can be installed in a server. This guarantees that it will always run correctly and in the same way, regardless of the system environment it is running in. The jQMM docker container can be run on any computer and any cloud computing service such as Amazon Web Services (AWS), Google Cloud Platform or Microsoft Azure. The container does not include GAMS, CPLEX or CONOPT licences, which must be provided by the user.

Modules

This section contains a succint explanation for each module and some of its classes and capabilities. A more detailed description of each class containing all possible functionality can be found in the Jupyter notebooks (Table 1) and the jQMM code on the github website (see availability below).

Core

The core module contains the classes that constitute the base for all other functionality (Fig. 2). These basic classes include the Reaction and Metabolite classes, along with the derived Product and Reactant classes. These classes are defined as required to describe labeling experiments and, as such, are different from equivalently named classes in COBRApy [26] (defined for FBA purposes). However, we expect in the future to derive these classes from the corresponding COBRApy classes. Other basic classes include the flux class, which is used to describe fluxes for a given reaction (See Jupyter notebook A1).

Core classes that describe concepts exclusively used in ¹³ C labeling experiments include the atomTransition, emu and emuTransition classes. The atomTransition class is used to hold carbon transition rules (information on the fate for each carbon atom in a reaction [23]), e.g.:

PDH: pyr –> co2 + accoa ; abc –> a + bc

Note that this is different from the Reaction class, which would only hold information on the reaction name, products and reactants for a reaction:

PDH: coa[c] + nad[c] + pyr[c] –> co2[c] + accoa[c] + nadh[c]

By using two classes to differentiate transitions and reactions, we minimize confusion between two related but different concepts. The emu class describes elementary metabolite units (emus, [35]), which are used to efficiently calculate labeling patterns corresponding to a flux profile. Emus represent moieties comprising any distinct subset of a compound’s atoms: for example, if pyruvate consists of 3 carbons, an emu is a subset of any number of these 3 carbons: e.g. pyr_1_2, pyr_1_2_3 or pyr_2_3 are emus. The EMUTransition class describes how emus transition into each other for each reaction, for example for citrate synthase:

CS, accoa_c_1_2 + oac_c_2 –> cit_c_3_4_5indicates that emus accoa_c_1_2 and oac_c_2 condense into emu cit_c_3_4_5. This initial decomposition of atom transitions into emus and emu transitions is fundamental to solving the forward problem (i.e. determine labeling from fluxes) in ¹³C MFA, and is provided as part of the core module, as shown in Jupyter notebook A2.

A final core class is the rangedNumber class, which describes floating point numbers with a lower and upper limit, reflecting confidence intervals. These type of numbers are used, for example, to hold information for a flux obtained from 2S- ¹³ C MFA, where we have a best fit value (0.5), plus upper (0.6) and lower limits (0.3) which represent the upper and lower flux values compatible with the experimental data (see Equation 23 in [23]):

PDH: [ 0.3 : 0.5 : 0.6 ]

ReactionNetworks

The ReactionNetworks module (see Additional file 1: Figure S1) contains the classes needed to store three types of reaction networks: stoichiometric (reactionNetwork, used for FBA), carbon transitions (C13reactionNetwork, for ¹³C MFA) and stochiometric with carbon transitions only for a defined core (TSreactionNetwork, for 2S-¹³C MFA). The first class contains the methods for manipulating purely stoichiometric reaction networks, such as those used in FBA, the second one contains the methods for dealing with carbon transition networks typically used in ¹³C MFA and the final class contains the methods used to manipulate networks where genome-scale stoichiometric information is mixed with carbon transitions information (used in 2S-¹³C MFA). Examples of these types of networks along with tutorials on how to use these classes to manipulate them can be found in Jupyter notebook A3.

FluxModels

The FluxModels module (Fig. 3, and Additional file 1: Figure S2) contains the classes used to solve each of the various types of flux problems solved in this library: FBA (FBAModel), ¹³C MFA (C13Model), 2S-¹³C MFA (TwoSC13Model) and ELVA (External Labeling Variability Analysis, ELVAModel, a subproblem of 2S-¹³C MFA, [23]). Flux Variability Analysis (FVA [36]) and ¹³C Flux Variability Analysis (¹³C FVA [23]) can be solved from the same model as for FBA and ¹³C MFA respectively, since they require the same information. Aditionally, classes for the results obtained from each of these types of models are included, which are used to plot, analyze and manipulate results (FBAResults, FVAResults, C13Results, TSResults, ELVAResults, C13FVAResults and TSFVAResults). More details can be found in Jupyter notebook A4.

EnhancedLists

The enhancedLists module (Additional file 1: Figure S3) holds lists that have been enhanced with methods useful for the objects contained. In this way, there is e.g. a ReactionList class that holds reactions and e.g. has a carbonTransitionsOK() method that tests if carbon transition and reaction information are compatible (same reactants, same products, etc). The types of lists included are ReactionList, MetaboliteList, EMUList, EMUTransitionList and AtomTransitionList. Their use is demonstrated in Jupyter notebook A2.

GAMSclasses

The GAMSclasses module contains the classes used to solve the optimization problems for each model (see below) through the GAMS modeling system (GAMS Development Corporation), either in series or in parallel (Fig. 4). As depicted in Fig. 1, all flux techniques require solving an optimization problem using a different type of solver (either CPLEX or CONOPT in GAMS), which are accessed through the GAMS framework. At this moment, this interaction is encoded in this class, but we expect to replace it soon by use of the GAMS python API. More information and tests can be found in Jupyter notebook A5.

Predictions

This module provides the classes needed to make flux predictions using the Minimization of Metabolic Adjustment (MoMA [36]) and Regulatory on/off minimization (ROOM [37]) methods that were used in the original 2S-¹³C MFA paper [23]. A demonstration can be found in Jupyter notebook A6.

Labeling

The Labeling module contains all classes needed to analyze, manipulate and plot labeling patterns (a.k.a Mass Distribution Vector, MDV, the fraction of molecules with n =1, 2, 3... labeled carbons) and fragments. The base class labelingData is used to derive LCMSLabelData, CEMSLabelData and GCMSLabelData, which are classes for each type of mass spectroscopy data. The fragment class contains all information for the MS fragment harboring the MDV and is used to derive classes GCMSfragment, LCMSfragment and CEMSfragment. More details can be found in Jupyter notebook A7.

DB

The DB module houses a database of all hard-coded information, conveniently located in a single module. This includes, for example, a dictionary of fragments with all their detailed information: name, abbreviation, formula, number of carbons, formula without carbon backbone, etc.

Utilities

The utilities module stores small functions that are useful in a variety of modules: e.g. a function to extract the atoms numbers from an elemental composition string (^′ H6NO2Si ^′⇒ [ (^′ H ^′,6),(^′ N ^′,1),(^′ O ^′,2),(^′ Si ^′,1)]).

Results and discussion

We will now demonstrate the use of the jQMM library for each of the techniques included in the package using data from the literature.

¹³ C Metabolic Flux Analysis for TCA toy model

The first demonstration (Fig. 5) involves the toy model of the TCA cycle originally used by Antoniewicz et al. [35] to showcase the Elementary Metabolite Unit (EMU) method (Fig. 12 in [35]). The toy model is a simplification of the tricarboxylic acid (TCA) cycle that still retains its main features. Acetyl coenzyme A (AcCoA) and aspartate are the two substrates, and glutamate and carbon dioxide are the products. The labeling for glutamate is calculated assuming a mixture of 25% [2-¹³C]AcCoA and 25% [1,2-¹³C]AcCoA as the tracer input (with everything else unlabeled), while aspartate is considered unlabeled. Fluxes are fixed to predetermined values, since the goal in this case is to showcase the calculation of labeling patterns from fluxes. Jupyter Notebook B2 demonstrates how, using jQMM, EMU transitions are generated, and solved through GAMS and CONOPT without the need to decompose the network into decoupled EMU reaction networks. The computed labeling pattern is the same as obtained through 13CFLUX2 [29](see Fig. 5, and Jupyter Notebook B2 for more details).

¹³ C Metabolic Flux Analysis for E. coli data

The next demonstration of the jQMM library involves a traditional use of ¹³ C MFA to calculate fluxes using the data from Toya et al. [38], in order to reproduce supplementary figure S18 from Garcia Martin et al. [23]. Fluxes are calculated for wild type E. coli fitting labeling data for 9 different intracellular metabolites. This calculation uses the original small-scale model for central carbon metabolism that lumps several reactions together and displays only an approximate account of fluxes to biomass. For example, as discussed in Garcia Martin et al. [23], the large fluxes draining acetyl-CoA into biomass and to the exterior of the cell as acetate (each a third of the pyruvate dehydrogenase flux, PDH) imposed in the original publication have been overestimated. The typical biomass function used in these minimal ¹³ C MFA models with reactions lumped together involves a specific stoichiometry of central carbon intermediates converted to biomass. However, this represents an approximation since some of the metabolites required for biomass growth are not present in the minimal network model and need to be substituted by their requirements in terms of intermediates considered in the minimal network (acetyl-CoA, in this case). These effects require significant effort to be accurately incorporated into a small-scale model, but they are properly handled by the genome-scale model, as demonstrated in the next example. Jupyter Notebook B3 shows how to calculate fluxes through ¹³ C MFA for this case.

2S- ¹³ C Metabolic Flux Analysis for E. coli data

This example calculates fluxes for a full comprehensive genome-scale model [23] constrained by ¹³C labeling data through the 2S-¹³C MFA algorithm (Fig. 6). The input is the same as for the ¹³C MFA case in the previous section, with the addition of the iJR904 genome-scale model [39]. As discussed in depth in Garcia Martin et al. [23], the solution displays nearly the same values as ¹³C MFA for central metabolism (see e.g. Figure S4 and S18 in [23]). This similarity is expected since 2S-¹³C MFA is designed to mimic ¹³C MFA for this part of metabolism (see “Limiting flux to core reactions” section in [23]). The only difference for the Toya et al. data set can be found in the TCA cycle flux. These differences appear because genome-scale models account for fluxes to biomass in a more realistic manner and because they do not rule out unexpected metabolic routes compatible with the available data.

Furthermore, the use of genome-scale models allows for some information that can not be obtained from smaller, non-comprehensive models, such as an account of all reactions producing and consuming e.g. NADPH as constrained by ¹³C labeling data (Fig. 6 in [23]). Other advantages of using genome-scale models include the prediction of non measured fluxes (Fig. S13 in [23]) and quantitative full predictions of directly measurable data (Fig. 10 in [23]). Jupyter Notebook B4 shows how to reproduce all figures from Garcia Martin et al. [23].

¹³ C Metabolic Flux Analysis for a soil microbial community

In order to show the general capabilities of the jQMM library for non-standard ¹³C MFA we will show how one can use this library to calculate fluxes for microbial communities as shown in Hagerty et al. [25]. The problem presented and solved in this paper is related to ¹³C MFA but, at the same time, the technical details differ notably. In this and other experiments [40, 41], Dijkstra et al. collected several soil samples and incubated them with two different carbon sources (glucose and pyruvate), where each source was chosen to have a different initial labeling (1-¹³C or fully labeled for glucose and 1-¹³C or 2,3-¹³C for pyruvate). For each carbon source and labeling type, the labeling fraction of CO₂ emmited from the soil was measured. For each carbon source, the two relative CO₂ labeling profiles provided enough information to fit a small model of carbon metabolism that represented the combination of reactions for all microbial entities in the soil. The good fits supported the assumption that all members of the community display a similar metabolic flux state, and the results are used to derive the carbon utilization efficiency (i.e. how much carbon is diverted into biomass and how much is lost as CO₂, a critical parameter for climate models [42]). Jupyter notebook B6 shows how to do these calculations using modular combinations of the jQMM library without recourse to complicated excel data sheets [25].

Flux balance analysis

The jQMM library provides the means to perform FBA (Jupyter notebook B1) and other COBRA methods for the sake of comparison with ¹³C-based methods [23]. However, the main purpose of jQMM is not to provide a COBRA library, since actively developed open source python libraries such as COBRApy [26] and cameo (http://cameo.bio/) are available for this purpose. Methods for FBA, FVA, Minimization of Metabolic Adjustment (MoMA [36]) and Regulatory on/off minimization (ROOM [37]) are provided. In order to showcase these capabilities we replicated part of the tutorial from the COBRA v 1.0 package in Becker et al. [43], which can be found in Jupyter notebook B1. Future versions of the library will likely be changed to use cameo for this functionality.

Principal Component Analysis of Proteomics (PCAP)

While most of the jQMM library is focused on flux analysis, we have found it useful to statistically model -omics data without the recourse to mechanistic models [24], an approach which we expect to be increasingly popular in the coming years given the increasing availability of -omics data. In Jupyter notebook B5, we show how to use the jQMM library and quantitative proteomics data to increase biofuel yields by replicating the PCAP (Principal Component Analysis for Proteomics) analysis with a recreation of previously published figures [24] (Fig. 7).

This case represents a prototypical example of pathway engineering, in which a series of exogenous genes from a variety of organisms (E. coli, S. cerevisiae, S. aureus, and plants such as M. spicata) were assembled together in a pathway able to produce limonene [44], a jet-fuel [45]. However, when assembling this pathway together, there was no easy way to figure out the design parameters for optimal function: what promoter strength to use, when to induce? how much? In this case, a combinatorial approach was used, using promoters of different strength (weak, medium, strong), different induction strengths (low, medium, high) and induction times (low, medium, high). For each of these scenarios, limonene production and protein expression for all genes in the pathway were measured. The raw proteomics data did not provide clear trends but the use of a basic data analysis tool such as a Principal Component Analysis (PCA) produced actionable results which allowed for the improvement of limonene production by 40% and bisabolene production by 200% leading to the highest producing strain of this biodiesel replacement to that date [24]. Jupyter notebook B5 shows how to perform this analysis.

Conclusion

We have presented here the jQMM library, an open-source modular library for modeling metabolism and the first open-source library for ¹³C MFA in Python. The jQMM library is also the first one to provide a method to calculate fluxes for genome-scale models constrained by ¹³C labeling data through 2S-¹³C MFA, and is the first to unify the FBA, ¹³C MFA and 2S-¹³C MFA in a single package. Unlike METRAN [27], INCA [28] and 13CFLUX2 [29]), it is open source and, unlike OPENFLUX [30] and OPENFLUX2 [31], it is written in Python. Furthermore, it includes an example on how to calculate fluxes for microbial communities through ¹³C MFA, and the tools to leverage other types of -omics data (e.g. proteomics for biofuel production).

While the examples here are mainly focused on various types of flux analysis (FBA, ¹³C MFA, 2S-¹³C MFA, etc) and biofuels, the tools are of general aplicability to other fields (e.g. pharma, biomedicine, environmental microbiology) and include statistical methods as well (e.g. PCAP). jQMM presents not only methods for modeling and generating actionable items form -omics data, but also libraries for data manipulation and visualization (fluxes and labeling patterns see Figs. 6 and 5), including classes for most common concepts in metabolism (e.g. reactions, metabolites, GC-MS fragments, reaction networks) in order to facilitate their manipulation, and which are of utility beyond the examples presented here. Every module in jQMM is explained in detail in Jupyter notebooks that allow for immediate reproducibility. Python unit tests are included for all modules for ease of testing when adding functionality.

jQMM will continue growing as the methods developed and used at JBEI for biofuel production expand and mature, while other authors contribute to it. At the moment jQMM is heavily dependent on the closed-source commercial platform GAMS and the CPLEX and CONOPT solvers. Our first future improvement to the jQMM library is planned to involve providing an interface to open-source and free version of solvers equivalent to GAMS. Further future improvements will likely include integration with COBRApy [26], cameo (cameo.bio) or KBase (kbase.us).

Availability of data and materials

Project name: jQMM library

Project home page: https://github.com/JBEI/jqmm

Operating systems: Platform independent

Programming language: Python, GAMS

Other requirements: CONOPT large scale nonlinear solver

All data used in this paper can be found in the github library itself. Additional file 1 contains a mathematical description of the algorithms, and Figures S1-S3.

Abbreviations

¹³C MFA:: ¹³C Metabolic Flux Analysis
2S-¹³C MFA:: Two-scale ¹³C Metabolic Flux Analysis
AcCoa:: Acetyl-CoA
API:: Application Program Interface
COBRA:: COnstraint Based Reconstruction and Analysis
ELVA:: External Labeling Variability Analysis
EMU:: Elementary Metabolite Unit
FBA:: Flux Balance Analysis
FVA:: Flux Variability Analysis
JBEI:: Joint BioEnergy Institute
jQMM:: JBEI Quantitative Metabolic Modeling
MDV:: Mass Distribution Vector
MoMA:: Minimum of Metabolic Adjustment
NADPH:: Nicotinamide adenine dinucleotide phosphate
PDH:: Pyruvate Dehydrogenase
PCAP:: Principal Component Analysis of Proteomics
ROOM:: Regulatory On/Off Minimization
TCA:: trycarboxylic acid

References

Nelson DL, Lehninger AL, Cox MM. Lehninger Principles of Biochemistry. New York: W. H. Freeman and Company; 2008.
Google Scholar
Xu F, Tavintharan S, Sum CF, Woon K, Lim SC, Ong CN. Metabolic signature shift in type 2 diabetes mellitus revealed by mass spectrometry-based metabolomics. J Clin Endocrinol Metab. 2013; 98(6). doi:10.1210/jc.2012-4132.
Cummins TD, Holden CR, Sansbury BE, Gibb AA, Shah J, Zafar N, Tang Y, Hellmann J, Rai SN, Spite M, Bhatnagar A, Hill BG. Metabolic remodeling of white adipose tissue in obesity,. Am J Physiol Endocrinol Metab. 2014; 307(3):262–77. doi:10.1152/ajpendo.00271.2013.
Article Google Scholar
Wu L, Gomes A, Sinclair D. Geroncogenesis: Metabolic changes during aging as a driver of tumorigenesis. 2014. NIHMS150003. doi:10.1016/j.ccr.2013.12.005.
Chubukov V, Mukhopadhyay A, Petzold CJ, Keasling JD, Martín HG. Synthetic and systems biology for microbial production of commodity chemicals. npj Syst Biol Appl. 2016; 2:16009. doi:10.1038/npjsba.2016.9.
Article Google Scholar
Lorek S, Spangenberg JH. Sustainable consumption within a sustainable economy – beyond green growth and green economies. J Clean Prod. 2014; 63:33–44. doi:10.1016/j.jclepro.2013.08.045.
Article Google Scholar
Nielsen J, Keasling JD. Engineering cellular metabolism. Cell. 2016; 164(6):1185–97.
Article CAS PubMed Google Scholar
Lee SY, Lee DY, Kim TY. Systems biotechnology for strain improvement. Trends Biotechnol. 2005; 23(7):349–58.
Article CAS PubMed Google Scholar
Han MJ, Yoon SS, Lee SY. Proteome analysis of metabolically engineeredescherichia coli producing poly (3-hydroxybutyrate). J Bacteriol. 2001; 183(1):301–8.
Article CAS PubMed PubMed Central Google Scholar
George KW, Chen A, Jain A, Batth TS, Baidoo EE, Wang G, Adams PD, Petzold CJ, Keasling JD, Lee TS. Correlation analysis of targeted proteins and metabolites to assess and engineer microbial isopentenol production. Biotech Bioeng. 2014; 111(8):1648–58.
Article CAS Google Scholar
Palsson B, Zengler K. The challenges of integrating multi-omic data sets. Nat Chem Biol. 2010; 6(11):787–9.
Article PubMed Google Scholar
Nielsen J, Fussenegger M, Keasling J, Lee SY, Liao JC, Prather K, Palsson B. Engineering synergy in biotechnology. Nat Chem Biol. 2014; 10(5):319–22.
Article CAS PubMed Google Scholar
Javidpour P, Pereira JH, Goh EB, McAndrew RP, Ma SM, Friedland GD, Keasling JD, Chhabra SR, Adams PD, Beller HR. Biochemical and structural studies of nadh-dependent fabg used to increase the bacterial production of fatty acids under anaerobic conditions. Appl Environ Microbiol. 2014; 80(2):497–505.
Article PubMed PubMed Central Google Scholar
Bhan N, Xu P, Khalidi O, Koffas MA. Redirecting carbon flux into malonyl-coa to improve resveratrol titers: proof of concept for genetic interventions predicted by optforce computational framework. Chem Eng Sci. 2013; 103:109–14.
Article CAS Google Scholar
Lee KH, Park JH, Kim TY, Kim HU, Lee SY. Systems metabolic engineering of escherichia coli for l-threonine production. Mol Syst Biol. 2007; 3(1):149.
CAS PubMed PubMed Central Google Scholar
Wetterstrand KA.DNA Sequencing Costs: Data from the NHGRI Genome Sequencing Program (GSP). 2016. http://www.genome.gov/sequencingcosts. Accessed 05 Feb 2016.
Batth TS, Singh P, Ramakrishnan VR, Sousa MML, Chan LJG, Tran HM, Luning EG, Pan EHY, Vuu KM, Keasling JD, Adams PD, Petzold CJ. A targeted proteomics toolkit for high-throughput absolute quantification of Escherichia coli proteins. Metab Eng. 2014; 26C:48–56.
Article Google Scholar
Fuhrer T, Zamboni N. High-throughput discovery metabolomics. Curr Opin Biotechnol. 2015; 31:73–8.
Article CAS PubMed Google Scholar
Blanch HW, Adams PD, Andrews-Cramer KM, Frommer WB, Simmons BA, Keasling JD. Addressing the need for alternative transportation fuels: the Joint BioEnergy Institute. ACS Chem Biol. 2008; 3(1):17–20.
Article CAS PubMed Google Scholar
Beller HR, Lee TS, Katz L. Natural products as biofuels and bio-based chemicals: fatty acids and isoprenoids. Nat Prod Rep. 2015; 32.10:1508–26.
Article Google Scholar
Wiechert W. 13C metabolic flux analysis. Metab Eng. 2001; 3(3):195–206.
Article CAS PubMed Google Scholar
Lewis NE, Nagarajan H, Palsson BO. Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat Rev Microbiol. 2012; 10(4):291–305.
CAS PubMed PubMed Central Google Scholar
Garcia Martin H, Kumar VS, Weaver D, Ghosh A, Chubukov V, Mukhopadhyay A, et al. A Method to Constrain Genome-Scale Models with 13C Labeling Data. PLoS Comput Biol. 2015; 11(9):e1004363. doi:10.1371/journal.pcbi.1004363.
Article PubMed Central Google Scholar
Alonso-Gutierrez J, Kim EM, Batth TS, Cho N, Hu Q, Chan LJG, Petzold CJ, Hillson NJ, Adams PD, Keasling JD, Garcia-Martin H, Soon Lee T. Principal component analysis of proteomics (PCAP) as a tool to direct metabolic engineering. Metab Eng. 2014; 28:123–33.
Article PubMed Google Scholar
Hagerty SB, van Groenigen KJ, Allison SD, Hungate BA, Schwartz E, Koch GW, Kolka RK, Dijkstra P. Accelerated microbial turnover but constant growth efficiency with warming in soil. Nat Clim Chang. 2014; 4(10):903–6.
Article CAS Google Scholar
Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Syst Biol. 2013; 7(1):74.
Article PubMed PubMed Central Google Scholar
Crown SB, Ahn WS, Antoniewicz MR. Rational design of 13C-labeling experiments for metabolic flux analysis in mammalian cells. BMC Syst Biol. 2012; 6(1):43.
Article CAS PubMed PubMed Central Google Scholar
Young JD. INCA: a computational platform for isotopically non-stationary metabolic flux analysis. Bioinforma (Oxford, England). 2014; 30(9):1333–5.
Article CAS Google Scholar
Weitzel M, et al. 13CFLUX2Ůhigh-performance software suite for 13C-metabolic flux analysis. Bioinformatics. 2013; 29.1:143–5.
Article Google Scholar
Quek LE, Wittmann C, Nielsen LK, Krömer JO. OpenFLUX: efficient modelling software for 13C-based metabolic flux analysis. Microb Cell Factories. 2009; 8(1):25.
Article Google Scholar
Shupletsov MS, Golubeva LI, Rubina SS, Podvyaznikov DA, Iwatani S, Mashko SV. OpenFLUX2: 13 C-MFA modeling software package adjusted for the comprehensive analysis of single and parallel labeling experiments. Microb Cell Factories. 2014; 13(1):152.
Google Scholar
Plant AL, Locascio LE, May WE, Gallagher PD. Improved reproducibility by assuring confidence in measurements in biomedical research. Nat Methods. 2014; 11(9):895–8.
Article CAS PubMed Google Scholar
Wilson G, Aruliah DA, Brown CT, Chue Hong NP, Davis M, Guy RT, Haddock SHD, Huff KD, Mitchell IM, Plumbley MD, Waugh B, White EP, Wilson P. Best Practices for Scientific Computing. PLoS Bio. 2014; 12(1). arXiv:1210.0530v3.
Pérez F, Granger BE. IPython: a system for interactive scientific computing. Comput Sci Eng. 2007; 9(3):21–9.
Article Google Scholar
Antoniewicz MR, Kelleher JK, Stephanopoulos G. Elementary metabolite units (EMU): A novel framework for modeling isotopic distributions. Metab Eng. 2007; 9(1):68–86. NIHMS150003.
Article CAS PubMed Google Scholar
Segrè D, Vitkup D, Church GM. Analysis of optimality in natural and perturbed metabolic networks. Proc Natl Acad Sci U S A. 2002; 99(23):15112–7.
Article PubMed PubMed Central Google Scholar
Shlomi T, Berkman O, Ruppin E. Regulatory on/off minimization of metabolic flux changes after genetic perturbations,. Proc Natl Acad Sci U S A. 2005; 102(21):7695–00.
Article CAS PubMed PubMed Central Google Scholar
Toya Y, Ishii N, Nakahigashi K, Hirasawa T, Soga T, Tomita M, Shimizu K. 13C-Metabolic flux analysis for batch culture of Escherichia coli and its pyk and pgi gene knockout mutants based on mass isotopomer distribution of intracellular metabolites. Biotechnol Prog. 2010; 26(4):975–92.
CAS PubMed Google Scholar
Reed JL, Vo TD, Schilling CH, Palsson BO. An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR). Genome Biol. 2003; 4(9):54.
Article Google Scholar
Dijkstra P, Dalder JJ, Selmants PC, Hart SC, Koch GW, Schwartz E, Hungate BA. Modeling soil metabolic processes using isotopologue pairs of position-specific 13C-labeled glucose and pyruvate. Soil Biol Biochem. 2011; 43(9):1848–57.
Article CAS Google Scholar
Dijkstra P, Thomas SC, Heinrich PL, Koch GW, Schwartz E, Hungate BA. Effect of temperature on metabolic activity of intact microbial communities: Evidence for altered metabolic pathway activity but not for increased maintenance respiration and reduced carbon use efficiency. Soil Biol Biochem. 2011; 43(10):2023–31.
Article CAS Google Scholar
Cox PM, Betts RA, Jones CD, Spall SA, Totterdell IJ. Acceleration of global warming due to carbon-cycle feedbacks in a coupled climate model. Nature. 2000; 408(6809):184–7.
Article CAS PubMed Google Scholar
Becker SA, Feist AM, Mo ML, Hannum G, Palsson BO, Herrgard MJ. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox. Nat Protoc. 2007; 2(3):727–38.
Article CAS PubMed Google Scholar
Alonso-Gutierrez J, Chan R, Batth TS, Adams PD, Keasling JD, Petzold CJ, Lee TS. Metabolic engineering of Escherichia coli for limonene and perillyl alcohol production. Metab Eng. 2013; 19:33–41.
Article CAS PubMed Google Scholar
Chuck CJ, Donnelly J. The compatibility of potential bioderived fuels with Jet A-1 aviation kerosene. Appl Energy. 2014; 118:83–91.
Article CAS Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This work was part of the DOE Joint BioEnergy Institute (http://www.jbei.org) supported by the U. S. Department of Energy, Office of Science, Office of Biological and Environmental Research, and was part of the Agile BioFoundry (http://agilebiofoundry.org) supported by the U.S. Department of Energy, Energy Efficiency and Renewable Energy, Bioenergy Technologies Office, through contract DE-AC02-05CH11231 between Lawrence Berkeley National Laboratory and the U. S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. This research is also supported by the Basque Government through the BERC 2014-2017 program and by Spanish Ministry of Economy and Competitiveness MINECO: BCAM Severo Ochoa excellence accreditation SEV-2013-0323.

Authors’ contributions

HGM, GWB, AG, VSK, DW and DA wrote the code and analyzed the data. HGM, GWB, DA, AA and JDK wrote the paper. TWHB provided the docker containers. All authors have read and approve of the final version of the manuscript.

Competing interests

JDK has financial interests in Amyris and Lygos. All other authors have no competing interests to declare.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Garrett W. Birkel, Amit Ghosh, Vinay S. Kumar, Daniel Weaver, David Ando, Tyler W. H. Backman, Adam P. Arkin, Jay D. Keasling & Héctor García Martín
Joint BioEnergy Institute, Emeryville, CA, USA
Garrett W. Birkel, Amit Ghosh, Vinay S. Kumar, Daniel Weaver, David Ando, Tyler W. H. Backman, Jay D. Keasling & Héctor García Martín
Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA, USA
Jay D. Keasling
Department of Bioengineering, University of California, Berkeley, CA, USA
Adam P. Arkin & Jay D. Keasling
Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Adam P. Arkin
School of Energy Science and Engineering, Indian Institute of Technology (IIT), Kharagpur, India
Amit Ghosh
Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Hørsholm, DK2970, Denmark
Jay D. Keasling
DOE Agile BioFoundry, Emeryville, CA, USA
Garrett W. Birkel, Tyler W. H. Backman & Héctor García Martín
BCAM, Basque Center for Applied Mathematics, Bilbao, Spain
Héctor García Martín

Authors

Garrett W. Birkel
View author publications
You can also search for this author in PubMed Google Scholar
Amit Ghosh
View author publications
You can also search for this author in PubMed Google Scholar
Vinay S. Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Weaver
View author publications
You can also search for this author in PubMed Google Scholar
David Ando
View author publications
You can also search for this author in PubMed Google Scholar
Tyler W. H. Backman
View author publications
You can also search for this author in PubMed Google Scholar
Adam P. Arkin
View author publications
You can also search for this author in PubMed Google Scholar
Jay D. Keasling
View author publications
You can also search for this author in PubMed Google Scholar
Héctor García Martín
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Héctor García Martín.

Additional information

The original version of this article was revised to correct fig.7

An erratum to this article is available at http://dx.doi.org/10.1186/s12859-017-1631-y.

Additional file

Additional file 1

Supplementary file containing the mathematical description of algorithms, and supplementary figures S1-S3. (PDF 2250 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Birkel, G.W., Ghosh, A., Kumar, V.S. et al. The JBEI quantitative metabolic modeling library (jQMM): a python library for modeling microbial metabolism. BMC Bioinformatics 18, 205 (2017). https://doi.org/10.1186/s12859-017-1615-y

Download citation

Received: 03 November 2016
Accepted: 25 March 2017
Published: 05 April 2017
DOI: https://doi.org/10.1186/s12859-017-1615-y

The JBEI quantitative metabolic modeling library (jQMM): a python library for modeling microbial metabolism

Abstract

Background

Results

Conclusions

Background

Methods

Key capabilities

Implementation

Modules

Core

ReactionNetworks

FluxModels

EnhancedLists

GAMSclasses

Predictions

Labeling

DB

Utilities

Results and discussion

13 C Metabolic Flux Analysis for TCA toy model

13 C Metabolic Flux Analysis for E. coli data

2S- 13 C Metabolic Flux Analysis for E. coli data

13 C Metabolic Flux Analysis for a soil microbial community

Flux balance analysis

Principal Component Analysis of Proteomics (PCAP)

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Authors’ contributions

Competing interests

Consent for publication

Ethics approval and consent to participate

Publisher’s Note

Author information

Authors and Affiliations

Corresponding author

Additional information

Additional file

Additional file 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Bioinformatics

Contact us

¹³ C Metabolic Flux Analysis for TCA toy model

¹³ C Metabolic Flux Analysis for E. coli data

2S- ¹³ C Metabolic Flux Analysis for E. coli data

¹³ C Metabolic Flux Analysis for a soil microbial community