Software | Open | Published:
WUFlux: an open-source platform for 13C metabolic flux analysis of bacterial metabolism
BMC Bioinformaticsvolume 17, Article number: 444 (2016)
Flux analyses, including flux balance analysis (FBA) and 13C-metabolic flux analysis (13C-MFA), offer direct insights into cell metabolism, and have been widely used to characterize model and non-model microbial species. Nonetheless, constructing the 13C-MFA model and performing flux calculation are demanding for new learners, because they require knowledge of metabolic networks, carbon transitions, and computer programming. To facilitate and standardize the 13C-MFA modeling work, we set out to publish a user-friendly and programming-free platform (WUFlux) for flux calculations in MATLAB®.
We constructed an open-source platform for steady-state 13C-MFA. Using GUIDE (graphical user interface design environment) in MATLAB, we built a user interface that allows users to modify models based on their own experimental conditions. WUFlux is capable of directly correcting mass spectrum data of TBDMS (N-tert-butyldimethylsilyl-N-methyltrifluoroacetamide)-derivatized proteinogenic amino acids by removing background noise. To simplify 13C-MFA of different prokaryotic species, the software provides several metabolic network templates, including those for chemoheterotrophic bacteria and mixotrophic cyanobacteria. Users can modify the network and constraints, and then analyze the microbial carbon and energy metabolisms of various carbon substrates (e.g., glucose, pyruvate/lactate, acetate, xylose, and glycerol). WUFlux also offers several ways of visualizing the flux results with respect to the constructed network. To validate our model’s applicability, we have compared and discussed the flux results obtained from WUFlux and other MFA software. We have also illustrated how model constraints of cofactor and ATP balances influence fluxome results.
Open-source software for 13C-MFA, WUFlux, with a user-friendly interface and easy-to-modify templates, is now available at http://www.13cmfa.org/or (http://tang.eece.wustl.edu/ToolDevelopment.htm). We will continue documenting curated models of non-model microbial species and improving WUFlux performance.
Metabolic flux analyses, including flux balance analysis (FBA) and 13C metabolic flux analysis (MFA), are widely used to predict or measure in vivo enzyme reaction rates in microbes. FBA can unravel microbial metabolism based on the stoichiometry of the metabolic reactions as well as measurements of the inflow (substrate uptake) and outflow fluxes (biomass and product synthesis). To facilitate the development of genome scale models, much software has been developed . Our research group built a web-based platform named MicrobesFlux (http://www.microbesflux.org/) . This platform can automatically draft a metabolic model from the annotated microbial genome in the KEGG database. Based on users’ feedback, we have re-built our system on a commercial server to improve its functionality, stability, and robustness. The new MicrobesFlux has been updated with both AMPL optimization software and metabolic network information from the latest version of the KEGG database. This platform now includes 3192 species compared to 1304 species in the previous version. Nevertheless, the MicrobesFlux platform still performs only FBA to estimate the flux values. A more rigorous flux analysis requires 13C-MFA, which combines FBA with 13C isotopic tracing. To complement the current platform, we sought to build an open-source MATLAB-based package (WUFlux) for metabolic flux analysis.
13C-MFA requires both experimental and modeling efforts (Fig. 1). 13C-labeling experiments consist of feeding the cell culture with defined 13C-substrates to fingerprint downstream metabolites with 13C-carbons. Once 13C has reached a steady state distribution throughout the metabolic network, the labeling patterns of proteinogenic amino acids or free metabolites can be used by a 13C-MFA model to decipher the intracellular flux distributions. 13C-MFA can help researchers discover novel pathways, resolve reversible and branched fluxes, and quantify circular metabolic routes (e.g., the tricarboxylic acid cycle or TCA cycle). However, 13C-MFA is challenging. In terms of experiments, conventional 13C-MFA requires that the cell cultures grow in a defined medium and under steady state conditions. The researchers need to select proper 13C tracers and obtain high-quality isotopomer data for flux analysis. Meanwhile, construction of the 13C-MFA model and flux calculation are demanding for new learners, because they require not only knowledge of metabolic networks and carbon transitions through the pathways, but also computer programming skills (Fig. 1). One 13C-MFA project on a non-model microbial species may take two graduate students one year to accomplish. As a matter of fact, fewer than 1000 13C-MFA papers have been published in the past two decades, many of which are reviews or method papers . In addition, most 13C-MFA studies focus on several model species (such as Bacillus subtilis and Escherichia coli). Although there are ~109 microbial species on this planet , only a few 13C-MFA studies have been carried out on non-model microbial species. If microbiologists had more and better user-friendly and programming-free 13C-MFA tools, they could quickly understand diverse microbial metabolisms in a quantitative manner.
To reduce modeling challenges, mass spectrum (MS) data correction tools and 13C-MFA software have been developed, including FiatFlux , iMS2Flux , INCA , METRAN , OpenFLUX , OpenMebius , 13CFLUX  and 13CFLUX2 . Using these tools and software, researchers can decipher metabolisms of bacterial, plant, and mammalian cells. Our laboratory has also been using 13C-MFA extensively to study both model and non-model bacterial species. Based on our experiences, we set out to build an open-source 13C-MFA platform (WUFlux) to facilitate analysis of metabolisms in diverse microbes. To reduce the work of constructing flux models, we provide several model templates with predefined metabolic network and carbon atom mappings. As a result, WUFlux can minimize the work done by users and facilitate straightforward flux analysis. Using this platform, we can also standardize and disseminate our MFA work by depositing curated models and flux results into the WUFlux database, which will further benefit the development of fluxomic databases for investigating diverse microbial species [13, 14].
We chose MATLAB as the programming environment, because it is broadly used by engineers and scientists in both industry and academia. We began with designing a graphical user interface by using GUIDE in MATLAB, and subsequently we created functions directly linked to tables, buttons, pop-up menus, and figures on the user interface.
Constructing a 13C MFA model in WUFlux starts with defining the metabolic reactions in the ‘Metabolic Reactions’ section. Instead of asking users to design the metabolic network and carbon transitions from scratch, we have included multiple templates which are suitable for studying chemoheterotrophic (e.g., E. coli, Shewanella oneidensis, and Bacillus subtilis), photomixotrophic cyanobacteria (e.g., Synechocystissp. PCC6803), and vanillin-degrading bacteria (e.g., Sphingobium SYK-6) [15–17]. Users can select an appropriate template, and easily make modifications to fine-tune the metabolic network, for example, by knocking out reactions, changing boundary conditions, and adding outflow fluxes.
In the ‘Experiment Data’ section, experimental information must be provided before flux calculations can be made (Fig. 2). The first entry is the ratio of nonlabeled biomass from inoculation to the entire labeled culture. If bacterial inoculation introduces a significant amount of non-labeled biomass in 13C-cultures, this ratio (with a default value of 0) will be used to correct the labeling patterns of measured metabolites. Next, the labeling patterns (or the mass isotopomer distribution, MID) of both substrates and proteinogenic amino acids or free metabolites are required. WUFlux can correct raw MSfr(N-tert-butyldimethylsilyl-N-methyltrifluoroacetamide)-derivatized proteinogenic amino acids by employing a previously developed algorithm , which promises accurate data correction. In addition, WUFlux can handle the application of multiple tracers (e.g., both glucose and glycerol) or isotopologues (e.g., 50 % [1-13C] glucose and 50 % [U-13C6] glucose) in labeling experiments. The final experimental information is the measured fluxes of any chemicals produced in the cell culture. The measured fluxes will be used in the objective function.
The ‘Settings’ section allows users to customize the optimization parameters (e.g., the number of initial guesses and maximum iteration number). Thereafter, the flux calculation is ready to start. To determine the fluxome, we used the element metabolite unit algorithm  to simulate the MIDs of proteinogenic amino acids or free metabolites. This method largely reduces the number of variables compared to the traditional isotopomer mapping matrices approach . The built-in MATLAB function ‘fmincon’ is employed for non-linear optimization, i.e., using ‘interior-point’ as the default algorithm, fmincon minimizes the differences between experimentally and computationally determined data weighted by measured variances. To avoid local solutions, users need to run different initial guesses of fluxes, so that fmincon can find the global optimal solution with the least SSR (sum of squared residuals) (Fig. 2).
The Monte Carlo method is used in the model to determine the confidence intervals of central metabolic fluxes. Briefly speaking, MID data are randomly perturbed with normally distributed noises (within the average range of measurement errors), and the flux profile is then recalculated multiple times, which is customizable in WUFlux. The 95 % confidence intervals, for example, are consequently determined by the upper and lower 2.5 % data via the bootstrap method. Additionally, the χ 2 test is applied to determine the goodness of fit, which users can use as the reference to determine whether the fitting is statistically acceptable.
Finally, all the flux values and confidence intervals are presented in the ‘Results’ panel, which can be exported to an Excel file. To better present the results, we have included functions that provide direct ways of visualizing the computed fluxes with respect to the constructed metabolic network and visualizing the comparisons between simulated and experimental MID data (see Additional file 1).
Results and discussion
Figure 2 shows the general procedures for performing 13C-MFA with WUFlux: 1) Choose a suitable template, 2) Modify the metabolic network and constraints, 3) Import the experimental data, 4) Customize the optimization parameters, 5) Estimate the flux distribution and determine the confidence intervals, and 6) Visualize the fluxes. More detailed information is provided in Additional file 1.
As a case study, we applied our software to reproduce the MID data and flux profile of both the control and engineered fatty-acid producing E. coli strains, which were published in our previous paper . As shown in Fig. 3a-b, WUFlux can convert raw MS data into effective MID data, which is in excellent accordance with MID correction results by a well-developed mass isotope correction software . We further used WUFlux to characterize the fluxomes of E. coli strains with corrected MID data. The results were then compared with those generated from METRAN and INCA (Fig. 3c-d and Additional file 2: Table S1). In general, the estimated flux values as well as the major changes between the control and engineered strains agree well with published data and optimization results from other software. All the differences are within 2 % of the glucose uptake rate. The flux results may differ for several reasons (Fig. 1). First, different software may employ different optimization algorithms and solvers for flux calculations. For example, WUFlux relies on the MATLAB built-in function ‘fmincon’, while INCA employs MATLAB’s ‘lsqlin’ function. Second, MID data used for flux calculation are not identical (e.g., WUFlux did not select the MID data of proline because this amino acid often shows high noise-to-signal ratios). Third, the detailed model settings (e.g., the objective functions, biomass equations, statistical analysis, and flux constraints) may not be exactly the same among those software. Additionally, we want to point out the flux calculations can differ between cases with and without consideration of isotopic impurity of labeled substrates  and natural abundance of nonlabeled carbons (Additional file 2: Table S1). To gain a more accurate flux analysis, we recommend users to consider both effects.
13C-MFA is an important tool to reveal a cell’s energy state for cell biosynthesis and well-being. In cellular processes, the energy molecule ATP is not only used for biosynthesis, but also consumed for diverse non-growth associated activities, such as cell repair and stress responses. 13C-MFA can calculate the total ATP generation from catabolism and ATP consumption for biosynthesis. The excess ATP can be assumed to be the maintenance cost, which is defined as the overall ATP required for maintaining each gram of biomass (mmol/g DW) in this study. Here, we demonstrate how to apply WUFlux to study energy metabolism by using the isotopomer data from reference . In Fig. 4a, we divided the carbon distributions into biomass synthesis, fatty acid production, CO2 loss, and acetate production. The results prove that the engineered strain can successfully direct more carbon flow towards fatty acid production, while the control strain uses the majority of the carbons for biomass synthesis. Additionally, we can use flux data to analyze cellular energy expenditure. For example, ATP loss for maintenance energy in the engineered strain was estimated to be two-fold larger than that in the control strain (Fig. 4b-c), suggesting that overproduction of fatty acid led to a higher energy burden on the host strain. 13C-MFA can quantify cell energy fluxes and is particularly useful for understanding the ATP and cofactor balances in engineered microbial hosts. Lastly, users can add an ‘energy balance’ equation in WUFlux (e.g., the ATP net production is equal to consumption for biosynthesis). Under such an assumption, the P/O ratios may impact flux calculation results. Figure 4d-f illustrates the influence of P/O ratios on flux estimation of the engineered E. coli strain. The results show that flux estimation is insensitive to P/O ratios under ‘energy unbalanced’ conditions (when the flux towards ATP maintenance loss is unconstrained, Fig. 4d and e). However, the flux values of many pathways and the values of SSR can be significantly affected by the P/O ratio under ‘energy balanced’ conditions (when the ATP maintenance loss is assumed to be zero, Fig. 4d and f).
13C-MFA is a powerful tool for metabolism analysis, but the overall process of performing 13C-MFA is usually not fast enough for biologists to characterize novel microbial species or to provide timely insights into engineered strains in the design-build-test-learn cycle. To overcome this problem, we have designed an open-source MATLAB-based platform, WUFlux, which provides programming-free and straightforward ways of performing 13C-MFA. By testing WUFlux against the other software, we showed that WUFlux can correct raw MS data and reproduce the flux estimation of previously published flux analysis studies. Because the MATLAB codes of all function files in WUFlux are open to researchers, users can extend or enhance its capabilities. By using this platform, we can standardize and document the details of 13C-MFA studies. We will continue to update the software package by including more flux models of non-model microbial species.
Availability and requirements
Project name: WUFlux
Project homepage: www.13cmfa.org
Operating systems: Preferably Windows OS 7 or higher
Programming language: MATLAB
Other requirements: MATLAB 2012b or higher with optimization toolbox, symbolic math toolbox, and statistic toolbox.
License: WUFlux is freely available.
Any restrictions to use by non-academics: none
Lakshmanan M, Koh G, Chung BK, Lee D-Y. Software applications for flux balance analysis. Brief Bioinform. 2014;15(1):108–22.
Feng X, Xu Y, Chen Y, Tang Y. MicrobesFlux: a web platform for drafting metabolic models from the KEGG database. BMC Syst Biol. 2012;6(1):94.
Crown SB, Antoniewicz MR. Publishing 13C metabolic flux analysis studies: a review and future perspectives. Metab Eng. 2013;20:42–8.
Schloss PD, Handelsman J. Status of the microbial census. Microbiol Mol Biol Rev. 2004;68(4):686–91.
Zamboni N, Fischer E, Sauer U. FiatFlux-a software for metabolic flux analysis from 13C-glucose experiments. BMC Bioinformatics. 2005;6:209.
Poskar CH, Huege J, Krach C, Franke M, Shachar-Hill Y, Junker B. iMS2Flux - a high-throughput processing tool for stable isotope labeled mass spectrometric data used for metabolic flux analysis. BMC Bioinformatics. 2012;13(1):295.
Young JD. INCA: a computational platform for isotopically non-stationary metabolic flux analysis. Bioinformatics. 2014;30(9):1333–5.
Yoo H, Antoniewicz MR, Stephanopoulos G, Kelleher JK. Quantifying reductive carboxylation flux of glutamine to lipid in a brown adipocyte cell line. J Biol Chem. 2008;283(30):20621–7.
Quek L-E, Wittmann C, Nielsen LK, Krömer JO. OpenFLUX: efficient modelling software for 13C-based metabolic flux analysis. Microb Cell Fact. 2009;8:25.
Kajihata S, Furusawa C, Matsuda F, Shimizu H. OpenMebius: an open source software for isotopically nonstationary 13C-based metabolic flux analysis. Biomed Res Int. 2014;2014:10.
Wiechert W, Mollney M, Petersen S, de Graaf A. A universal framework for 13C metabolic flux analysis. Metab Eng. 2001;3:265–83.
Weitzel M, Noh K, Dalman T, Niedenfuhr S, Stute B, Wiechert W. 13CFLUX2 - high-performance software suite for 13C-metabolic flux analysis. Bioinformatics. 2013;29(1):143–5.
Gang S, Wang Y, Jiang W, Oyetunde T, Yao R, Zhang X, Shimizu K, Tang Y, Bao F. Rapid prediction of bacterial fluxomics using machine learning and constraint programming. PLoS Comput Biol. 2016;12(4):e1004838.
Zhang Z, Shen T, Rui B, Zhou W, Zhou X, Shang C, Xin C, Liu X, Li G, Jiang J, et al. CeCaFDB: a curated database for the documentation, visualization and comparative analysis of central carbon metabolic flux distributions explored by 13C-fluxomics. Nucleic Acids Res. 2015;43(D1):D549–57.
He L, Xiao Y, Gebreselassie N, Zhang F, Antoniewicz MR, Tang YJ, Peng L. Central metabolic responses to the overproduction of fatty acids in Escherichia coli based on 13C-metabolic flux analysis. Biotechnol Bioeng. 2014;111(3):575–85.
You L, Berla B, He L, Pakrasi HB, Tang YJ. 13C-MFA delineates the photomixotrophic metabolism of Synechocystis sp. PCC 6803 under light- and carbon-sufficient conditions. Biotechnol J. 2014;9(5):684–92.
Varman AM, He L, Follenfant R, Wu W, Wemmer S, Wrobel SA, Tang YJ, Singh S. Decoding how a soil bacterium extracts building blocks and metabolic energy from ligninolysis provides road map for lignin valorization. Proc Natl Acad Sci USA. 2016;113(40):E5802-E5811.
Wahl SA, Dauner M, Wiechert W. New tools for mass isotopomer data evaluation in 13C flux analysis: mass isotope correction, data consistency checking, and precursor relationships. Biotechnol Bioeng. 2004;85(3):259–68.
Antoniewicz MR, Kelleher JK, Stephanopoulos G. Elementary metabolite units (EMU): a novel framework for modeling isotopic distributions. Metab Eng. 2007;9(1):68–86.
Feng X, Tang YJ. Evaluation of isotope discrimination in 13C-based metabolic flux analysis. Anal Biochem. 2011;417(2):295–7.
Hollinshead WD, Henson WR, Abernathy M, Moon TS, Tang YJ. Rapid metabolic analysis of Rhodococcus opacus PD630 via parallel 13C‐metabolite fingerprinting. Biotechnol Bioeng. 2016;113(1):91–100.
We thank Prof. James Ballard for editorial advice on our manuscript.
The project was funded by NSF (DBI 1356669 and MCB 1616619).
YJT and LH initiated the project. LH, YJT and SGW built the original user interface and programmed WUFlux. MZ and YC improved the computational algorithm, user interface, and visualization of flux distributions. LH and SGW prepared the first draft of manuscript and user manual. LH, SGW, MZ, YC, and YJT read, edited, and approved the manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate