WUFlux: an open-source platform for 13C metabolic flux analysis of bacterial metabolism

He, Lian; Wu, Stephen G.; Zhang, Muhan; Chen, Yixin; Tang, Yinjie J.

doi:10.1186/s12859-016-1314-0

Software
Open access
Published: 04 November 2016

WUFlux: an open-source platform for ¹³C metabolic flux analysis of bacterial metabolism

Lian He¹,
Stephen G. Wu¹,
Muhan Zhang²,
Yixin Chen² &
…
Yinjie J. Tang¹

BMC Bioinformatics volume 17, Article number: 444 (2016) Cite this article

4246 Accesses
29 Citations
Metrics details

Abstract

Background

Flux analyses, including flux balance analysis (FBA) and ¹³C-metabolic flux analysis (¹³C-MFA), offer direct insights into cell metabolism, and have been widely used to characterize model and non-model microbial species. Nonetheless, constructing the ¹³C-MFA model and performing flux calculation are demanding for new learners, because they require knowledge of metabolic networks, carbon transitions, and computer programming. To facilitate and standardize the ¹³C-MFA modeling work, we set out to publish a user-friendly and programming-free platform (WUFlux) for flux calculations in MATLAB^®.

Results

We constructed an open-source platform for steady-state ¹³C-MFA. Using GUIDE (graphical user interface design environment) in MATLAB, we built a user interface that allows users to modify models based on their own experimental conditions. WUFlux is capable of directly correcting mass spectrum data of TBDMS (N-tert-butyldimethylsilyl-N-methyltrifluoroacetamide)-derivatized proteinogenic amino acids by removing background noise. To simplify ¹³C-MFA of different prokaryotic species, the software provides several metabolic network templates, including those for chemoheterotrophic bacteria and mixotrophic cyanobacteria. Users can modify the network and constraints, and then analyze the microbial carbon and energy metabolisms of various carbon substrates (e.g., glucose, pyruvate/lactate, acetate, xylose, and glycerol). WUFlux also offers several ways of visualizing the flux results with respect to the constructed network. To validate our model’s applicability, we have compared and discussed the flux results obtained from WUFlux and other MFA software. We have also illustrated how model constraints of cofactor and ATP balances influence fluxome results.

Conclusion

Open-source software for ¹³C-MFA, WUFlux, with a user-friendly interface and easy-to-modify templates, is now available at http://www.13cmfa.org/or (http://tang.eece.wustl.edu/ToolDevelopment.htm). We will continue documenting curated models of non-model microbial species and improving WUFlux performance.

Background

Metabolic flux analyses, including flux balance analysis (FBA) and ¹³C metabolic flux analysis (MFA), are widely used to predict or measure in vivo enzyme reaction rates in microbes. FBA can unravel microbial metabolism based on the stoichiometry of the metabolic reactions as well as measurements of the inflow (substrate uptake) and outflow fluxes (biomass and product synthesis). To facilitate the development of genome scale models, much software has been developed [1]. Our research group built a web-based platform named MicrobesFlux (http://www.microbesflux.org/) [2]. This platform can automatically draft a metabolic model from the annotated microbial genome in the KEGG database. Based on users’ feedback, we have re-built our system on a commercial server to improve its functionality, stability, and robustness. The new MicrobesFlux has been updated with both AMPL optimization software and metabolic network information from the latest version of the KEGG database. This platform now includes 3192 species compared to 1304 species in the previous version. Nevertheless, the MicrobesFlux platform still performs only FBA to estimate the flux values. A more rigorous flux analysis requires ¹³C-MFA, which combines FBA with ¹³C isotopic tracing. To complement the current platform, we sought to build an open-source MATLAB-based package (WUFlux) for metabolic flux analysis.

¹³C-MFA requires both experimental and modeling efforts (Fig. 1). ¹³C-labeling experiments consist of feeding the cell culture with defined ¹³C-substrates to fingerprint downstream metabolites with ¹³C-carbons. Once ¹³C has reached a steady state distribution throughout the metabolic network, the labeling patterns of proteinogenic amino acids or free metabolites can be used by a ¹³C-MFA model to decipher the intracellular flux distributions. ¹³C-MFA can help researchers discover novel pathways, resolve reversible and branched fluxes, and quantify circular metabolic routes (e.g., the tricarboxylic acid cycle or TCA cycle). However, ¹³C-MFA is challenging. In terms of experiments, conventional ¹³C-MFA requires that the cell cultures grow in a defined medium and under steady state conditions. The researchers need to select proper ¹³C tracers and obtain high-quality isotopomer data for flux analysis. Meanwhile, construction of the ¹³C-MFA model and flux calculation are demanding for new learners, because they require not only knowledge of metabolic networks and carbon transitions through the pathways, but also computer programming skills (Fig. 1). One ¹³C-MFA project on a non-model microbial species may take two graduate students one year to accomplish. As a matter of fact, fewer than 1000 ¹³C-MFA papers have been published in the past two decades, many of which are reviews or method papers [3]. In addition, most ¹³C-MFA studies focus on several model species (such as Bacillus subtilis and Escherichia coli). Although there are ~10⁹ microbial species on this planet [4], only a few ¹³C-MFA studies have been carried out on non-model microbial species. If microbiologists had more and better user-friendly and programming-free ¹³C-MFA tools, they could quickly understand diverse microbial metabolisms in a quantitative manner.

To reduce modeling challenges, mass spectrum (MS) data correction tools and ¹³C-MFA software have been developed, including FiatFlux [5], iMS2Flux [6], INCA [7], METRAN [8], OpenFLUX [9], OpenMebius [10], 13CFLUX [11] and 13CFLUX2 [12]. Using these tools and software, researchers can decipher metabolisms of bacterial, plant, and mammalian cells. Our laboratory has also been using ¹³C-MFA extensively to study both model and non-model bacterial species. Based on our experiences, we set out to build an open-source ¹³C-MFA platform (WUFlux) to facilitate analysis of metabolisms in diverse microbes. To reduce the work of constructing flux models, we provide several model templates with predefined metabolic network and carbon atom mappings. As a result, WUFlux can minimize the work done by users and facilitate straightforward flux analysis. Using this platform, we can also standardize and disseminate our MFA work by depositing curated models and flux results into the WUFlux database, which will further benefit the development of fluxomic databases for investigating diverse microbial species [13, 14].

WUFlux implementation

We chose MATLAB as the programming environment, because it is broadly used by engineers and scientists in both industry and academia. We began with designing a graphical user interface by using GUIDE in MATLAB, and subsequently we created functions directly linked to tables, buttons, pop-up menus, and figures on the user interface.

Constructing a ¹³C MFA model in WUFlux starts with defining the metabolic reactions in the ‘Metabolic Reactions’ section. Instead of asking users to design the metabolic network and carbon transitions from scratch, we have included multiple templates which are suitable for studying chemoheterotrophic (e.g., E. coli, Shewanella oneidensis, and Bacillus subtilis), photomixotrophic cyanobacteria (e.g., Synechocystissp. PCC6803), and vanillin-degrading bacteria (e.g., Sphingobium SYK-6) [15–17]. Users can select an appropriate template, and easily make modifications to fine-tune the metabolic network, for example, by knocking out reactions, changing boundary conditions, and adding outflow fluxes.

In the ‘Experiment Data’ section, experimental information must be provided before flux calculations can be made (Fig. 2). The first entry is the ratio of nonlabeled biomass from inoculation to the entire labeled culture. If bacterial inoculation introduces a significant amount of non-labeled biomass in ¹³C-cultures, this ratio (with a default value of 0) will be used to correct the labeling patterns of measured metabolites. Next, the labeling patterns (or the mass isotopomer distribution, MID) of both substrates and proteinogenic amino acids or free metabolites are required. WUFlux can correct raw MSfr(N-tert-butyldimethylsilyl-N-methyltrifluoroacetamide)-derivatized proteinogenic amino acids by employing a previously developed algorithm [18], which promises accurate data correction. In addition, WUFlux can handle the application of multiple tracers (e.g., both glucose and glycerol) or isotopologues (e.g., 50 % [1-¹³C] glucose and 50 % [U-¹³C₆] glucose) in labeling experiments. The final experimental information is the measured fluxes of any chemicals produced in the cell culture. The measured fluxes will be used in the objective function.

The ‘Settings’ section allows users to customize the optimization parameters (e.g., the number of initial guesses and maximum iteration number). Thereafter, the flux calculation is ready to start. To determine the fluxome, we used the element metabolite unit algorithm [19] to simulate the MIDs of proteinogenic amino acids or free metabolites. This method largely reduces the number of variables compared to the traditional isotopomer mapping matrices approach [11]. The built-in MATLAB function ‘fmincon’ is employed for non-linear optimization, i.e., using ‘interior-point’ as the default algorithm, fmincon minimizes the differences between experimentally and computationally determined data weighted by measured variances. To avoid local solutions, users need to run different initial guesses of fluxes, so that fmincon can find the global optimal solution with the least SSR (sum of squared residuals) (Fig. 2).

The Monte Carlo method is used in the model to determine the confidence intervals of central metabolic fluxes. Briefly speaking, MID data are randomly perturbed with normally distributed noises (within the average range of measurement errors), and the flux profile is then recalculated multiple times, which is customizable in WUFlux. The 95 % confidence intervals, for example, are consequently determined by the upper and lower 2.5 % data via the bootstrap method. Additionally, the χ ² test is applied to determine the goodness of fit, which users can use as the reference to determine whether the fitting is statistically acceptable.

Finally, all the flux values and confidence intervals are presented in the ‘Results’ panel, which can be exported to an Excel file. To better present the results, we have included functions that provide direct ways of visualizing the computed fluxes with respect to the constructed metabolic network and visualizing the comparisons between simulated and experimental MID data (see Additional file 1).

Results and discussion

Figure 2 shows the general procedures for performing ¹³C-MFA with WUFlux: 1) Choose a suitable template, 2) Modify the metabolic network and constraints, 3) Import the experimental data, 4) Customize the optimization parameters, 5) Estimate the flux distribution and determine the confidence intervals, and 6) Visualize the fluxes. More detailed information is provided in Additional file 1.

As a case study, we applied our software to reproduce the MID data and flux profile of both the control and engineered fatty-acid producing E. coli strains, which were published in our previous paper [15]. As shown in Fig. 3a-b, WUFlux can convert raw MS data into effective MID data, which is in excellent accordance with MID correction results by a well-developed mass isotope correction software [18]. We further used WUFlux to characterize the fluxomes of E. coli strains with corrected MID data. The results were then compared with those generated from METRAN and INCA (Fig. 3c-d and Additional file 2: Table S1). In general, the estimated flux values as well as the major changes between the control and engineered strains agree well with published data and optimization results from other software. All the differences are within 2 % of the glucose uptake rate. The flux results may differ for several reasons (Fig. 1). First, different software may employ different optimization algorithms and solvers for flux calculations. For example, WUFlux relies on the MATLAB built-in function ‘fmincon’, while INCA employs MATLAB’s ‘lsqlin’ function. Second, MID data used for flux calculation are not identical (e.g., WUFlux did not select the MID data of proline because this amino acid often shows high noise-to-signal ratios). Third, the detailed model settings (e.g., the objective functions, biomass equations, statistical analysis, and flux constraints) may not be exactly the same among those software. Additionally, we want to point out the flux calculations can differ between cases with and without consideration of isotopic impurity of labeled substrates [20] and natural abundance of nonlabeled carbons (Additional file 2: Table S1). To gain a more accurate flux analysis, we recommend users to consider both effects.

¹³C-MFA is an important tool to reveal a cell’s energy state for cell biosynthesis and well-being. In cellular processes, the energy molecule ATP is not only used for biosynthesis, but also consumed for diverse non-growth associated activities, such as cell repair and stress responses. ¹³C-MFA can calculate the total ATP generation from catabolism and ATP consumption for biosynthesis. The excess ATP can be assumed to be the maintenance cost, which is defined as the overall ATP required for maintaining each gram of biomass (mmol/g DW) in this study. Here, we demonstrate how to apply WUFlux to study energy metabolism by using the isotopomer data from reference [15]. In Fig. 4a, we divided the carbon distributions into biomass synthesis, fatty acid production, CO₂ loss, and acetate production. The results prove that the engineered strain can successfully direct more carbon flow towards fatty acid production, while the control strain uses the majority of the carbons for biomass synthesis. Additionally, we can use flux data to analyze cellular energy expenditure. For example, ATP loss for maintenance energy in the engineered strain was estimated to be two-fold larger than that in the control strain (Fig. 4b-c), suggesting that overproduction of fatty acid led to a higher energy burden on the host strain. ¹³C-MFA can quantify cell energy fluxes and is particularly useful for understanding the ATP and cofactor balances in engineered microbial hosts. Lastly, users can add an ‘energy balance’ equation in WUFlux (e.g., the ATP net production is equal to consumption for biosynthesis). Under such an assumption, the P/O ratios may impact flux calculation results. Figure 4d-f illustrates the influence of P/O ratios on flux estimation of the engineered E. coli strain. The results show that flux estimation is insensitive to P/O ratios under ‘energy unbalanced’ conditions (when the flux towards ATP maintenance loss is unconstrained, Fig. 4d and e). However, the flux values of many pathways and the values of SSR can be significantly affected by the P/O ratio under ‘energy balanced’ conditions (when the ATP maintenance loss is assumed to be zero, Fig. 4d and f).

Conclusion

¹³C-MFA is a powerful tool for metabolism analysis, but the overall process of performing ¹³C-MFA is usually not fast enough for biologists to characterize novel microbial species or to provide timely insights into engineered strains in the design-build-test-learn cycle. To overcome this problem, we have designed an open-source MATLAB-based platform, WUFlux, which provides programming-free and straightforward ways of performing ¹³C-MFA. By testing WUFlux against the other software, we showed that WUFlux can correct raw MS data and reproduce the flux estimation of previously published flux analysis studies. Because the MATLAB codes of all function files in WUFlux are open to researchers, users can extend or enhance its capabilities. By using this platform, we can standardize and document the details of ¹³C-MFA studies. We will continue to update the software package by including more flux models of non-model microbial species.

Availability and requirements

Project name: WUFlux
Project homepage: www.13cmfa.org
Operating systems: Preferably Windows OS 7 or higher
Programming language: MATLAB
Other requirements: MATLAB 2012b or higher with optimization toolbox, symbolic math toolbox, and statistic toolbox.
License: WUFlux is freely available.
Any restrictions to use by non-academics: none

References

Lakshmanan M, Koh G, Chung BK, Lee D-Y. Software applications for flux balance analysis. Brief Bioinform. 2014;15(1):108–22.
Article PubMed Google Scholar
Feng X, Xu Y, Chen Y, Tang Y. MicrobesFlux: a web platform for drafting metabolic models from the KEGG database. BMC Syst Biol. 2012;6(1):94.
Article PubMed PubMed Central Google Scholar
Crown SB, Antoniewicz MR. Publishing ¹³C metabolic flux analysis studies: a review and future perspectives. Metab Eng. 2013;20:42–8.
Article CAS PubMed Google Scholar
Schloss PD, Handelsman J. Status of the microbial census. Microbiol Mol Biol Rev. 2004;68(4):686–91.
Article PubMed PubMed Central Google Scholar
Zamboni N, Fischer E, Sauer U. FiatFlux-a software for metabolic flux analysis from ¹³C-glucose experiments. BMC Bioinformatics. 2005;6:209.
Article PubMed PubMed Central Google Scholar
Poskar CH, Huege J, Krach C, Franke M, Shachar-Hill Y, Junker B. iMS2Flux - a high-throughput processing tool for stable isotope labeled mass spectrometric data used for metabolic flux analysis. BMC Bioinformatics. 2012;13(1):295.
Article PubMed PubMed Central Google Scholar
Young JD. INCA: a computational platform for isotopically non-stationary metabolic flux analysis. Bioinformatics. 2014;30(9):1333–5.
Article CAS PubMed PubMed Central Google Scholar
Yoo H, Antoniewicz MR, Stephanopoulos G, Kelleher JK. Quantifying reductive carboxylation flux of glutamine to lipid in a brown adipocyte cell line. J Biol Chem. 2008;283(30):20621–7.
Article CAS PubMed PubMed Central Google Scholar
Quek L-E, Wittmann C, Nielsen LK, Krömer JO. OpenFLUX: efficient modelling software for ¹³C-based metabolic flux analysis. Microb Cell Fact. 2009;8:25.
Article PubMed PubMed Central Google Scholar
Kajihata S, Furusawa C, Matsuda F, Shimizu H. OpenMebius: an open source software for isotopically nonstationary ¹³C-based metabolic flux analysis. Biomed Res Int. 2014;2014:10.
Article Google Scholar
Wiechert W, Mollney M, Petersen S, de Graaf A. A universal framework for ¹³C metabolic flux analysis. Metab Eng. 2001;3:265–83.
Article CAS PubMed Google Scholar
Weitzel M, Noh K, Dalman T, Niedenfuhr S, Stute B, Wiechert W. 13CFLUX2 - high-performance software suite for ¹³C-metabolic flux analysis. Bioinformatics. 2013;29(1):143–5.
Article CAS PubMed Google Scholar
Gang S, Wang Y, Jiang W, Oyetunde T, Yao R, Zhang X, Shimizu K, Tang Y, Bao F. Rapid prediction of bacterial fluxomics using machine learning and constraint programming. PLoS Comput Biol. 2016;12(4):e1004838.
Zhang Z, Shen T, Rui B, Zhou W, Zhou X, Shang C, Xin C, Liu X, Li G, Jiang J, et al. CeCaFDB: a curated database for the documentation, visualization and comparative analysis of central carbon metabolic flux distributions explored by ¹³C-fluxomics. Nucleic Acids Res. 2015;43(D1):D549–57.
Article PubMed Google Scholar
He L, Xiao Y, Gebreselassie N, Zhang F, Antoniewicz MR, Tang YJ, Peng L. Central metabolic responses to the overproduction of fatty acids in Escherichia coli based on ¹³C-metabolic flux analysis. Biotechnol Bioeng. 2014;111(3):575–85.
Article CAS PubMed Google Scholar
You L, Berla B, He L, Pakrasi HB, Tang YJ. ¹³C-MFA delineates the photomixotrophic metabolism of Synechocystis sp. PCC 6803 under light- and carbon-sufficient conditions. Biotechnol J. 2014;9(5):684–92.
Article CAS PubMed Google Scholar
Varman AM, He L, Follenfant R, Wu W, Wemmer S, Wrobel SA, Tang YJ, Singh S. Decoding how a soil bacterium extracts building blocks and metabolic energy from ligninolysis provides road map for lignin valorization. Proc Natl Acad Sci USA. 2016;113(40):E5802-E5811.
Wahl SA, Dauner M, Wiechert W. New tools for mass isotopomer data evaluation in ¹³C flux analysis: mass isotope correction, data consistency checking, and precursor relationships. Biotechnol Bioeng. 2004;85(3):259–68.
Article CAS PubMed Google Scholar
Antoniewicz MR, Kelleher JK, Stephanopoulos G. Elementary metabolite units (EMU): a novel framework for modeling isotopic distributions. Metab Eng. 2007;9(1):68–86.
Article CAS PubMed Google Scholar
Feng X, Tang YJ. Evaluation of isotope discrimination in ¹³C-based metabolic flux analysis. Anal Biochem. 2011;417(2):295–7.
Article CAS PubMed Google Scholar
Hollinshead WD, Henson WR, Abernathy M, Moon TS, Tang YJ. Rapid metabolic analysis of Rhodococcus opacus PD630 via parallel ¹³C‐metabolite fingerprinting. Biotechnol Bioeng. 2016;113(1):91–100.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Prof. James Ballard for editorial advice on our manuscript.

Funding

The project was funded by NSF (DBI 1356669 and MCB 1616619).

Authors’ contributions

YJT and LH initiated the project. LH, YJT and SGW built the original user interface and programmed WUFlux. MZ and YC improved the computational algorithm, user interface, and visualization of flux distributions. LH and SGW prepared the first draft of manuscript and user manual. LH, SGW, MZ, YC, and YJT read, edited, and approved the manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Author information

Authors and Affiliations

Department of Energy, Environmental and Chemical Engineering, Washington University, St. Louis, MO, 63130, USA
Lian He, Stephen G. Wu & Yinjie J. Tang
Department of Computer Science and Engineering, Washington University, St. Louis, MO, 63130, USA
Muhan Zhang & Yixin Chen

Authors

Lian He
View author publications
You can also search for this author in PubMed Google Scholar
Stephen G. Wu
View author publications
You can also search for this author in PubMed Google Scholar
Muhan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yixin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yinjie J. Tang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Lian He or Yinjie J. Tang.

Additional files

Additional file 1:

User manual for WUFlux (available at www.13cmfa.org). (PDF 1218 kb)

Additional file 2:

Comparison of flux estimations from WUFlux, METRAN, and INCA. (DOCX 18 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

He, L., Wu, S.G., Zhang, M. et al. WUFlux: an open-source platform for ¹³C metabolic flux analysis of bacterial metabolism. BMC Bioinformatics 17, 444 (2016). https://doi.org/10.1186/s12859-016-1314-0

Download citation

Received: 10 January 2016
Accepted: 26 October 2016
Published: 04 November 2016
DOI: https://doi.org/10.1186/s12859-016-1314-0

WUFlux: an open-source platform for ¹³C metabolic flux analysis of bacterial metabolism