Skip to main content

CiliateGEM: an open-project and a tool for predictions of ciliate metabolic variations and experimental condition design

Abstract

Background

The study of cell metabolism is becoming central in several fields such as biotechnology, evolution/adaptation and human disease investigations. Here we present CiliateGEM, the first metabolic network reconstruction draft of the freshwater ciliate Tetrahymena thermophila. We also provide the tools and resources to simulate different growth conditions and to predict metabolic variations. CiliateGEM can be extended to other ciliates in order to set up a meta-model, i.e. a metabolic network reconstruction valid for all ciliates.

Ciliates are complex unicellular eukaryotes of presumably monophyletic origin, with a phylogenetic position that is equal from plants and animals. These cells represent a new concept of unicellular system with a high degree of species, population biodiversity and cell complexity. Ciliates perform in a single cell all the functions of a pluricellular organism, including locomotion, feeding, digestion, and sexual processes.

Results

After generating the model, we performed an in-silico simulation with the presence and absence of glucose. The lack of this nutrient caused a 32.1% reduction rate in biomass synthesis. Despite the glucose starvation, the growth did not stop due to the use of alternative carbon sources such as amino acids.

Conclusions

The future models obtained from CiliateGEM may represent a new approach to describe the metabolism of ciliates. This tool will be a useful resource for the ciliate research community in order to extend these species as model organisms in different research fields. An improved understanding of ciliate metabolism could be relevant to elucidate the basis of biological phenomena like genotype-phenotype relationships, population genetics, and cilia-related disease mechanisms.

Background

A metabolic network represents in an organism the complete set of biochemical reactions suitable to synthesise or break-down metabolites. These reactions drive the production of biomass and energy to support all cellular processes. The reconstruction of a full metabolic network occurring within each cell has advanced from early biochemical studies to algorithmically-generated pathway diagrams starting from genomic sequencing [1]. Genome-scale models (GEMs) offer a comprehensive exploration and a rapid analysis of genomic data. GEMs have been used extensively to study metabolic engineering [2, 3], model-driven drug discoveries [4, 5], prediction of cellular phenotypes after perturbations [6, 7], analysis of evolutionary processes [8,9,10,11] and models of interspecies interactions [12]. Organism-specific reconstructed metabolic networks may be further implemented to build mathematical models capable of simulating metabolic fluxes [13]. GEM pathway reconstruction has been used in Caenorhabditis elegans to predict genes essentiality [14] and to better understand the biology of arthropods [15], including those with a negative impact (vectors of human or animal diseases, agricultural pests). The latter approach is particularly useful to control harmful species and to develop new precautionary strategies [15]. Genome-scale metabolic modelling has also been successfully applied to study metabolic networks in microbes [16], including a Polychlorinated Biphenyl-degrading Pseudomonas [17, 18], thermophilic bacteria [19] and members of the human gut microbiota [20].

Among eukaryotic microbes, at least 25 models of Saccharomyces cerevisiae have been published since 2003 [21], helping to understand yeast metabolism. Ciliated protozoans may represent an alternative and useful eukaryotic model. Ciliates have been the main subject of projects supported by the EU Framework Programme Horizon 2020 such as the COST Actions. Moreover, the National Centre for Genome Resources and the Gordon and Betty Moore Foundationā€™s supported the Marine Microbiology Initiative (MMI). MMI created a valuable benchmark against the analysis of environmental transcriptomic data [22]. Ciliates, as yeasts, are eukaryotic single cells, but their structural and functional complexity is comparable to human and other metazoan cells. These unicellular organisms are similar to differentiated animal cells with complex functions and membrane-bound structures [23]. The potential number of extant ciliate species has been estimated at nearly 30,000 [24]. They represent an important mediator in the food chain by transforming ultrafine organic matter useful for zooplankton. Being individual cells, they are directly exposed to environmental changes, making them good models for studying cell-stress response and adaptation. Ciliates propagate mainly asexually by transverse fission, even though they perform conjugation, a sexual process that ā€œrenewsā€ the genetic material. The complexity of ciliates is further represented by the presence of two different nuclei: the diploid micronucleus involved in conjugation and the polyploid macronucleus. While the first represents the germinal line, the second represents the somatic line. The macronucleus is responsible for gene expression during the vegetative phase.

We describe below a preliminary open software tool (CiliateGEM), focused on the T. thermophila macronuclear genomic sequences, which allows ciliates researchers to analyse a reconstructed network via Flux Balance Analysis (FBA). By studying this ciliate, we could discover new mechanisms for evolution and adaptation within metabolism, population, species and host-symbiont association. Cellular responses described in T. thermophila can be of fundamental importance to understand the biology of all ciliates. CiliateGEM is to date the most advanced tool available for ciliates and contains the highest number of curated biochemical reactions.

Results

CiliateGEM: An open project and a methodological pipeline

The complexity of ciliates makes it necessary to adopt a new approach to study their metabolism. To understand these organisms is fundamental to consider that they can feed, move, and reproduce in a single cell. Ciliate complexity includes also cell compartmentalisation, which ensures the optimal environment for each specific metabolic reaction (i.e. optimal lysosomes pH for macromolecule hydrolysis).

To date, CiliateGEM allows the analysis of their reconstructed networks using the ciliate Tetrahymena thermophila as a case study. We choose this organism because Tetrahymena micro [25] and macro-nuclear [26] genomes have been sequenced and its biology has been extensively studied. The goal of this open project is to create a tool to simulate the growth of Tetrahymena cells in different conditions and then to expand it to all ciliates. The development and the refinement of this tool, coupled with the COBRA toolbox [27], can lead researchers to predict gene essentiality and genotype-phenotype relationship. CiliateGEM has been obtained using a bottom-up approach, from genome annotation to a mathematical model. The steps we followed are represented in Fig. 1.

Fig. 1
figure 1

CiliateGEM pipeline. CiliateGEM was constructed using the protocol by Thiele and Palsson [13], as well as manual curation. To characterise the metabolic networks, we have gathered all core pathways from different organisms, including bacteria. CiliateGEM is provided in SBML and Matlab format as Additional files 6 and 7

Tetrahymena growth simulation

One of the first experimental in vitro studies of metabolic pathways in ciliates were performed using Tetrahymena pyriformis as a model organism [28,29,30]. Tetrahymena cells were grown in a standard mixture of l4C radiolabelled substrates including glucose, glycerol, pyruvate and glutamate. The incorporation of l4C in CO2, glycogen and nucleic acids was measured with ranges of values depending on the carbon labelled position in the substrate structure. Borowitz et al. [29] reported a higher incorporation of l4C into CO2 after 1 h of incubation with [l4C]glucose than in the same experiment with [l4C]glycerol, i.e. 43.9ā€“64.2 nmol/106 cells versus 19.8ā€“19.9 nmol/106 cells. Stein et al. [30] further reported a higher radiolabelled C incorporation in CO2 from [l4C]glucose than from pyruvate, glutamate and glycerol. The measured values of the labelled carbon incorporation into the CO2 product were as follows: [l4C]glucose (106ā€“255 nmol/106 cells); [l4C]pyruvate (55.4ā€“121 nmol/106 cells); [l4C]glutamate (1.90ā€“3.96 nmol/106 cells); [l4C]glycerol (11.0ā€“13.6 nmol/106 cells). Then, they tested glycogen as product, and the [l4C]glucose resulted the preferred substrate with the highest incorporation rates (313ā€“496 nmol/106 cells). A comparable incorporation rate between glycerol and pyruvate was reported (~ā€‰6.5 nmol/106 cells), as well as an absent incorporation from glutamate (<ā€‰0.2 nmol/106 cells). The results differed from those of CO2 among the secondary substrates (glutamate, pyruvate and glycerol), but agreed on the highest used substrate (glucose).

Our initial simulation showed the maximum growth when glucose, glutamate, glycerol and pyruvate were used as growth media. In agreement with literature [29, 30], simulations of CiliateGEM showed that glucose is utilised as the preferred carbon source (Fig. 2). The uptake for glucose in our model (āˆ’ā€‰10 mmol/gDW/hr) is proportionally much higher than those of the secondary substrates (glutamate, pyruvate and glycerol) described by the authors in the previous in-vitro experiments [29, 30]. We allowed such uptake to saturate the system so the full amount is not necessarily utilised because of the thermodynamics constraints. Glucose is metabolised through glycolysis in the cytosol. A simulation in the presence of the above metabolites but without glucose showed a 32.1% reduction rate on biomass synthesis (Fig. 3). Despite the reduced biomass production, the growth without glucose is largely maintained by utilising alternative carbon sources such as amino acids (Fig. 3), as shown by the higher increase of biomass production for reactions R00891 (i.e. L-Serine + Hydrogen sulfide <=ā€‰>ā€‰L-Cysteine + H2O) and R00258 (L-Alanine +ā€‰2-Oxoglutarate <=ā€‰>ā€‰Pyruvate + L-Glutamate).

Fig. 2
figure 2

Rate of biomass synthesis by CiliateGEM from different substrates. The CiliateGEM model was allowed to utilise different carbon sources for growth. Glucose, glutamate and pyruvate consumption (illustrated by negative flux) directly affects the growth rate of CiliateGEM (depicted by positive flux values)

Fig. 3
figure 3

Differential biomass production (%) after glucose starvation. Values for growth with (Gg) and without glucose (Gwg) were used in this formula (Gwg-Gg)/|Gg|*100

After glucose starvation, reactions R10822 [ADPā€‰+ā€‰DNA(n)ā€‰+ā€‰5ā€™-Phospho-DNA(m)ā€‰<ā€‰=> AMPā€‰+ā€‰Orthophosphate + DNA(nā€‰+ā€‰m)] and R00381 [ATPā€‰+ā€‰DNA(n)ā€‰+ā€‰5ā€™-Phospho-DNA(m)ā€‰<ā€‰=> AMPā€‰+ā€‰Diphosphate + DNA(nā€‰+ā€‰m)] showed the highest positive and negative values. Both reactions are catalysed by DNA ligases during DNA repair or recombination. These two reactions differ only on ATP or ADP cofactors. ATP is the phosphorylated form of ADP and is the preferred cofactor because phosphoanhydride bonds store a high amount of energy. Since ATP is synthesised from ADP during glycolysis by phosphorylation, we can speculate that in absence of glucose DNA ligation by ligase is performed by using ADP molecules as an alternative. DNA ligase using ADP has been reported from the aerobic hyper-thermophilic archaeon Aeropyrum pernix K1 [31].

Other opposite values were reported for reactions R02549 (4-Aminobutyraldehyde + NAD+ + H2Oā€‰<ā€‰=>ā€‰4-Aminobutanoate + NADH + H+) and R01986 (4-Aminobutyraldehyde + NADP+ + H2Oā€‰<ā€‰=>ā€‰4-Aminobutanoate + NADPH + H+), both involving in the arginine-catabolism pathway. Like R10822 and R00381 they only differ on cofactors, in this case NAD and NADP. While the structural difference between these two molecules is only the phosphate group, NADH participates in catabolic reactions, i.e. reactions that break down molecules to release energy, while NADPH participates in anabolic reactions, namely those that consume energy in order to build up or synthesise larger molecules.

Finally, reactions R04241 (ATPā€‰+ā€‰THF-polyglutamate(n)ā€‰+ā€‰L-Glutamate <=ā€‰>ā€‰ADPā€‰+ā€‰Orthophosphate + THF-polyglutamate(nā€‰+ā€‰1)) and R04242 (THF-polyglutamate + n H2Oā€‰<ā€‰=> Tetrahydrofolate + n L-Glutamate) showed an increase of biomass production in the simulation without glucose. Both reactions are involved in folate biosynthesis, an essential cofactor for DNA and amino acid synthesis.

Discussion

Studies on Tetrahymena have led ciliate researchers to landmark discoveries on cellular mechanisms, including the first cytoskeletal motor [32] and programmed translational frameshifting [33]. Some other studies have been awarded two Nobel prizes: to T. Cech and S. Altman for the discovery of catalytic RNA in 1989; and to E. H. Blackburn, C. W. Greider and J. W. Szostak for their work on telomeres in 2009. In general, mechanisms first described in ciliates have proven to be of widespread occurrence and of fundamental importance for the biology of all eukaryotes. Furthermore, ciliates are the only unicellular organisms in which programmed DNA elimination during somatic differentiation, widespread in animals [34], has been deeply characterised [35].

In this study, we set the basis of the open project CiliateGEM and we constructed a metabolic model draft of T. thermophila, made freely available to researchers. We checked the model robustness by simulating the organism cell growth in the presence or absence of glucose (the preferred carbon source utilised by this organism). Then we compared glucose to glutamate, pyruvate and glycerol consumption. Despite the organisms shows a decreased rate of biomass production, the growth without glucose is maintained by using alternative carbon sources, as amino acids and the tetrahydrofolate synthesis (essential cofactor in several metabolic reactions). This metabolic variation appears to increase the activity of ADP dependent DNA ligase I, possibly involved in DNA repair or recombination (R10822). We also speculated that in absence of glucose, ADP may be the preferred energy molecule instead of ATP, and the metabolism is switched towards catabolic reactions in order to obtain energy for growth. Our results showed consistency with previous studies on Tetrahymena pyriformis [28,29,30], therefore suggesting that CiliateGEM could provide a framework for the entire community.

CiliateGEM is freely available for academics and can be fully customised and updated. Our long-term objective is to achieve a ā€œmeta-modelā€ that can be applied for metabolic simulations using other ciliates genomic resources (Additional file 1). The impact of CiliateGEM is to allow researchers to predict the response to changes in experimental conditions (e.g. starvation), and consequently to optimise the design of experimental protocols. Although this study focuses on T. thermophila, the findings may be shared with other ciliates as Paramecium and Euplotes species, including Antarctic strains [36]. In the future, we aim to reconstruct a multi-compartment model in symbiosis with algae by incorporating our datasets with an algal genome-scale model [37] (53% of reactions in the CiliateGEM are common in algae). Moreover, the complex systems of ciliates membranous compartments associated to phagocytosis and trafficking is strictly correlated with nutrients uptake and metabolism [38, 39]. We believe that understanding the role of compartments is even more important in multi compartmentalised unicellular organisms than in multicellular organisms (a list of ciliate organelles is reported in Additional file 2: Table S2). Reactions from different ciliates can be integrated in order to build up a hypothesis-driven scalable metabolic model based on common reactions.

Conclusions

In conclusion, our work confirms that Tetrahymena can be used as model organism also for metabolic network reconstruction. Even though Tetrahymena is much less known from the metabolic point of view than others model organisms such as Saccharomyces and C. elegans (including the metabolic network operating during grown at laboratory-controlled conditions), the availability of ā€œomicā€ resources allows reliable modelling of metabolic fluxes. Tools and resources obtained for Tetrahymena can then be applied to other ciliates and can elucidate genotype-phenotype mapping. A key point of this project is to create a collaborative network useful to expand and validate CiliateGEM with in vitro experiments on metabolic reactions already collected inside the model, and ultimately add new reactions. Collaboration between research teams in this type of projects is essential to speed up the process and to join the expertise of different research groups. A consensus modelling approach, already successfully adapted for many metabolic reconstructions including human and yeast, might help researchers to understand the ciliate differences at genera, species, subspecies and populations level. Furthermore, with a combination of shared resources, it will be possible to study full ecosystems (food chain from endosymbionts, associated bacteria, predators-preys). These models, coupled with multi-omic approaches [40], could also highlight aspects of evolutionary biology and biogeography, sympatric and allopatric speciation.

Methods

Tetrahymena model (CiliateGEM) reconstruction

CiliateGEM was created using the protocol described by Thiele and Palsson [13]. First, AUGUSTUS (a gene prediction web server [41]), was used to identify all the Tetrahymena protein-encoding genes from the macronuclear genome assembly and then to assign the functional roles through accurate annotation with EC (Enzyme Commission) number. Next, a list of metabolic reactions was generated from KEGG and converted into a stoichiometric matrix. KEGG was used to search for Tetrahymena pathways and all the reactions were manually assembled. The specific links are listed in Additional file 3. A general list of exchange reactions was obtained from the literature [42]. The reaction list was reduced and adapted to fit Tetrahymena metabolism (Additional file 4). Establishing enzyme-reaction relationship is a complex step because an enzyme can be encoded by one or many genes, and each enzyme can catalyse one or more reactions [43]. We defined the growth media and external factors based on Tetrahymena experimental studies [28,29,30]. This facilitated us in capturing the organism physiological properties. The produced model was then converted into a mathematical one used to simulate the maximal growth with flux-balance analysis (FBA) [44].

FBA requires reactions to be represented as a stoichiometric matrix (S), with dimensions of mā€‰Ć—ā€‰n, where the metabolites (m) are represented as rows and the reactions (n) are represented as columns. The stoichiometric matrix is a numerical matrix of stoichiometric coefficients for each metabolite participating in a reaction. FBA assumes the system to be in a pseudo-steady state Sā€‰ā‹…ā€‰vā€‰=ā€‰0 holding for internal metabolites (reactants and products of the chemical reactions), where the vector v represents the flux distribution. An internal metabolite constitutes the model but cannot be imported or exported directly. An ā€œexchangeā€ counterpart of that metabolite, and an exchange reaction are required for modelling import/export reactions. Exchange metabolites can be imported and exported from the system, so they do not satisfy the steady state assumption. The flux distribution v is a vector of reaction rates and represents a feasible flux of metabolites through the reaction network, where under the principle of mass conservation, the total amount of metabolite consumed and total amount of metabolite produced are equal to zero.

In FBA, directionality and capacity constraints are placed on individual reactions by defining the upper (Vmax) and lower (Vmin) bounds on the range of values that the flux of each reaction can have (Vmin ā‰¤ v ā‰¤ Vmax). These constraints define the space of allowable flux distributions at which every metabolite is consumed or produced by each reaction in the system. Such flux bounds, coupled with gene-protein-reaction association rules, can also be used to map multiple environmental conditions, using binary threshold-based or continuous valve-based approaches [45, 46]. Despite these constraints, the system is still underdetermined (more unknowns than equations) and, as a result, infinite viable solutions exist. A flux distribution is obtained by defining an objective function that is a scalar product of the vector of flux rates v, and a vector of weights c, measuring how each component in the network contributes to the production of a biologically desirable phenotype.

Formally, we adopted the following linear program:

$$ \max\ c\cdotp v,\mathrm{such}\ \mathrm{that} $$
$$ S\cdot v=\dot{x} $$
$$ V\mathit{\min}\le v\le V\mathit{\max} $$
$$ {\dot{x}}_l=0\kern0.24em if\;{M}_i\in \mathrm{internal}\kern0.17em \mathrm{metabolites} $$
$$ {\dot{x}}_l\in R\kern0.24em if\kern0.28em {M}_i\kern0.28em \in \kern0.28em \mathrm{exchange}\kern0.34em \mathrm{metabolites} $$

CiliateGEM, consists of 545 reactions, 828 metabolites, 64 transport step reactions (which represent the import and export of metabolites between extra- and intra-cellular space), and 84 boundary steps (which represent the input and output of metabolites from extracellular space and a biomass reaction). The biomass reaction defines a unique objective function to effective growth. This describes the rate at which all of the biomass precursors are made in the correct proportions. Linear programming was used to calculate the optimal flux distribution that maximizes growth. The model was encoded in SBML format [47] and fulfils MIRIAM requirements [48]. It was then imported in MATLAB (version R2016b). The simulation was carried out using COBRA [49] toolbox with the linear programming solver GLPK. All biomass flux values were given in millimoles/hour/grams of dry weight (mmol/gDW/hr). CiliateGEM is provided as Additional files (Additional file 5: CiliateModel.mat).

Abbreviations

COBRA:

The COnstraint-Based Reconstruction and Analysis Toolbox

FBA:

Flux Balance Analysis

GEM:

Genome-scale models

MMI:

Marine Microbiology Initiative

References

  1. Henry CS, DeJongh M, Best AA, Frybarger PM, Linsay B, Stevens RL. High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat Biotechnol. 2010;28:977ā€“82.

    ArticleĀ  CASĀ  Google ScholarĀ 

  2. McAnulty MJ, Yen JY, Freedman BG, Senger RS. Genome-scale modeling using flux ratio constraints to enable metabolic engineering of clostridial metabolism in silico. BMC Syst Biol. 2012;6:42.

    ArticleĀ  CASĀ  Google ScholarĀ 

  3. Kim B, Kim WJ, Kim DI, Lee SY. Applications of genome-scale metabolic network model in metabolic engineering. J Ind Microbiol Biotechnol. 2015;42:339ā€“48.

    ArticleĀ  CASĀ  Google ScholarĀ 

  4. Folger O, Jerby L, Frezza C, Gottlieb E, Ruppin E, Shlomi T. Predicting selective drug targets in cancer through metabolic networks. Mol Syst Biol. 2011;7:501.

    ArticleĀ  Google ScholarĀ 

  5. Shaked I, Oberhardt MA, Atias N, Sharan R, Ruppin E. Metabolic network prediction of drug side effects. Cell Syst. 2016;2(3):209ā€“13.

    ArticleĀ  CASĀ  Google ScholarĀ 

  6. Eyassu F, Angione C. Modelling pyruvate dehydrogenase under hypoxia and its role in cancer metabolism. R Soc Open Sci. 2017;4(10):170360.

    ArticleĀ  Google ScholarĀ 

  7. Monk JM, Charusanti P, Aziz RK, Lerman JA, Premyodhin N, Orth JD, Feist AM, Palsson BƘ. Genome-scale metabolic reconstructions of multiple Escherichia coli strains highlight strain-specific adaptations to nutritional environments. Proc Natl Acad Sci U S A. 2013;110(50):20338ā€“43.

    ArticleĀ  CASĀ  Google ScholarĀ 

  8. Angione C, LiĆ³ P. Predictive analytics of environmental adaptability in multi-omic network models. Sci Rep. 2015;5:15147.

    ArticleĀ  CASĀ  Google ScholarĀ 

  9. Babaei P, Ghasemi-Kahrizsangi T, Marashi SA. Modeling the differences in biochemical capabilities of pseudomonas species by flux balance analysis: how good are genome-scale metabolic networks at predicting the differences? Sci World J. 2014;2014:416289.

    Google ScholarĀ 

  10. van Heck RG, Ganter M, Martins Dos Santos VA, Stelling J. Efficient Reconstruction of Predictive Consensus Metabolic Network Models. PLoS Comput Biol. 2016;12(8):e1005085.

    ArticleĀ  Google ScholarĀ 

  11. Koehorst JJ, van Dam JC, van Heck RG, Saccenti E, Dos Santos VA, Suarez-Diez M, Schaap PJ. Comparison of 432 Pseudomonas strains through integration of genomic, functional, metabolic and expression data. Sci Rep. 2016;6:38699.

    ArticleĀ  CASĀ  Google ScholarĀ 

  12. Levy R, Borenstein E. Metabolic modeling of species interaction in the human microbiome elucidates community-level assembly rules. Proc Natl Acad Sci U S A. 2013;110(31):12804ā€“9.

    ArticleĀ  CASĀ  Google ScholarĀ 

  13. Thiele I, Palsson BƘ. A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat Protoc. 2010;5(1):93ā€“121.

    ArticleĀ  CASĀ  Google ScholarĀ 

  14. Yilmaz LS, Walhout AJ. A Caenorhabditis elegans genome-scale metabolic network model. Cell Syst. 2016;2(5):297ā€“311.

    ArticleĀ  CASĀ  Google ScholarĀ 

  15. Baa-Puyoulet P, Parisot N, Febvay G, et al. ArthropodaCyc: a CycADS powered collection of BioCyc databases to analyse and compare metabolism of arthropods. Database. 2016;2016. https://doi.org/10.1093/database/baw081.

    ArticleĀ  Google ScholarĀ 

  16. Vijayakumar S, Conway M, LiĆ³ P, Angione C. Seeing the wood for the trees: a forest of methods for optimization and omic-network integration in metabolic modelling. Brief Bioinform. 2017. https://doi.org/10.1093/bib/bbx053.

  17. Taffi M, Paoletti N, Angione C, Pucciarelli S, Marini M, LiĆ² P. Bioremediation in marine ecosystems: a computational study combining ecological modeling and flux balance analysis. Front Genet. 2014;5:319ā€“25.

    ArticleĀ  Google ScholarĀ 

  18. Taffi M, Paoletti N, LiĆ² P, Pucciarelli S, Marini M. Bioaccumulation modelling and sensitivity analysis for discovering key players in contaminated food webs: the case study of PCBs in the adriatic sea. Ecol Model. 2015;306:205ā€“15.

    ArticleĀ  CASĀ  Google ScholarĀ 

  19. Dahal S, Poudel S, Thompson RA. Genome-scale modeling of thermophilic microorganisms. Adv Biochem Eng Biotechnol. 2016;160:103ā€“19.

    Google ScholarĀ 

  20. MagnĆŗsdĆ³ttir S, Heinken A, Kutt L, Ravcheev DA, Bauer E, Noronha A, Greenhalgh K, JƤger C, Baginska J, Wilmes P, Fleming RM, Thiele I. Generation of genome-scale metabolic reconstructions for 773 members of the human gut microbiota. Nat Biotechnol. 2017;35(1):81ā€“9.

    ArticleĀ  Google ScholarĀ 

  21. Heavner BD, Price ND. Comparative Analysis of Yeast Metabolic Network Models Highlights Progress, Opportunities for Metabolic Reconstruction. PLoS Comput Biol. 2015;11(11):e1004530.

    ArticleĀ  Google ScholarĀ 

  22. Marine Microbiology Initiative (MMI): //https://www.imicrobe.us/. Accessed 7 Aug 2018.

  23. Bright LJ, Kambesis N, Nelson SB, Jeong B, Turkewitz AP. Comprehensive analysis reveals dynamic and evolutionary plasticity of Rab GTPases and membrane traffic in Tetrahymena thermophila. PLoS Genet. 2010;6(10):e1001155.

    ArticleĀ  Google ScholarĀ 

  24. Warren A, Patterson DJ, Dunthorn M, et al. Beyond the ā€œcodeā€: a guide to the description and documentation of biodiversity in ciliated Protists (Alveolata, Ciliophora). J Eukaryot Microbiol. 2017;64:539ā€“54.

    ArticleĀ  Google ScholarĀ 

  25. Hamilton EP, Kapusta A, Huvos PE, et al. Structure of the germline genome of Tetrahymena thermophila and relationship to the massively rearranged somatic genome. elife. 2016;5:e19090.

    ArticleĀ  Google ScholarĀ 

  26. Eisen JA, Coyne RS, Wu M, Wu D, Thiagarajan M, Wortman JR, Delcher AL. Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote. PLoS Biol. 2006;4(9):e286.

    ArticleĀ  Google ScholarĀ 

  27. Heirendt L, Arreckx S, Pfau T, Mendoza SN, Richelle A, Heinken A, Haraldsdottir HS, Keating SM, Vlasov V, Wachowiak J, Magnusdottir S. Creation and analysis of biochemical constraint-based models: the COBRA Toolbox v3. 0. arXiv preprint arXiv:1710.04038. 2017.

    Google ScholarĀ 

  28. Raugi GJ, Liang T, Blum JJ. A quantitative analysis of metabolite fluxes along some of the pathways of intermediary metabolism in Tetrahymena pyriformis. J Biol Chem. 1975;250(15):5866ā€“76.

    CASĀ  PubMedĀ  Google ScholarĀ 

  29. Borowitz MJ, Stein RB, Blum JJ. Quantitative analysis of the change of metabolite fluxes along the pentose phosphate and glycolytic pathways in Tetrahymena in response to carbohydrates. J Biol Chem. 1977 Mar 10;252(5):1589ā€“605.

    CASĀ  PubMedĀ  Google ScholarĀ 

  30. Steint B, Blum JJ. Quantitative analysis of intermediary metabolism in Tetrahymnena. J Biol Chem. 1980;255(9):4198ā€“205.

    Google ScholarĀ 

  31. Jeon SJ, Ishikawa K. A novel ADP-dependent DNA ligase from Aeropyrum pernix K1. FEBS Lett. 2003;550(1ā€“3):69ā€“73.

    ArticleĀ  CASĀ  Google ScholarĀ 

  32. Gibbons IR, Rowe AJ. Dynein: a protein with adenosine triphosphatase activity from cilia. Science. 1965;149:424ā€“6.

    ArticleĀ  CASĀ  Google ScholarĀ 

  33. Lobanov AV, Heaphy SM, Turanov AA, Gerashchenko MV, Pucciarelli S, Devaraj RR, Xie F, Petyuk VA, Smith RD, Klobutcher LA, Atkins JF, Miceli C, Hatfield DL, Baranov PV, Gladyshev VN. Position-dependent termination and widespread obligatory frameshifting in Euplotes translation. Nat Struct Mol Biol. 2017;24(1):61ā€“8.

    ArticleĀ  CASĀ  Google ScholarĀ 

  34. Wang J, Davis RE. Programmed DNA elimination in multicellular organisms. Curr Opin Genet Dev. 2014;27:26ā€“34.

    ArticleĀ  CASĀ  Google ScholarĀ 

  35. Nowacki M, Landweber LF. Epigenetic inheritance in ciliates. Curr Opin Microbiol. 2009;12(6):638ā€“43. https://doi.org/10.1016/j.mib.2009.09.012.

    ArticleĀ  CASĀ  PubMedĀ  PubMed CentralĀ  Google ScholarĀ 

  36. Pucciarelli S, La Terza A, Ballarini P, Barchetta S, Yu T, Marziale F, Passini V, Methe B, Detrich HW III, Miceli C. Molecular cold-adaptation of protein function and gene regulation: the case for comparative genomic analyses in marine ciliated protozoa. Mar Genomics. 2009;2:57ā€“66.

    ArticleĀ  Google ScholarĀ 

  37. Dalā€™Molin CG, Quek LE, Palfreyman RW, Nielsen LK. AlgaGEM ā€“ a genome-scale metabolic reconstruction of algae based on the Chlamydomonas reinhardtii genome. BMC Genomics. 2011;12(Suppl 4):55.

    Google ScholarĀ 

  38. Nusblat AD, Bright LJ, Turkewitz AP. Conservation and innovation in Tetrahymena membrane traffic: proteins, lipids, and compartments. Methods Cell Biol. 2012;109:141.

    ArticleĀ  Google ScholarĀ 

  39. Guerrier S, Plattner H, Richardson E, Dacks JB, Turkewitz AP. An evolutionary balance: conservation vs innovation in ciliate membrane trafficking. Traffic. 2017;18(1):18ā€“28.

    ArticleĀ  CASĀ  Google ScholarĀ 

  40. Angione C, LiĆ² P, Pucciarelli S, Can B, Conway M, et al. 2016. Bioinformatics Challenges and Potentialities in Studying Extreme Environments, Vol. 9874 LNCS.

    Google ScholarĀ 

  41. Stanke M, Morgenstern B. AUGUSTUS: a web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic Acids Res. 2005;33:465ā€“7.

    ArticleĀ  Google ScholarĀ 

  42. Nogales J, Palsson BO. Thiele Ines. A genome-scale metabolic reconstruction of Pseudomonas putida KT2440: i JN746 as a cell factory. BMC Syst Biol. 2008;2:1ā€“79.

    ArticleĀ  Google ScholarĀ 

  43. Poolman MG, Bonde BK, Gevorgyan A, Patel HH, Fell DA. Challenges to be faced in the reconstruction of metabolic networks from public databases. Syst Biol. 2006;153(5):379ā€“84.

    ArticleĀ  CASĀ  Google ScholarĀ 

  44. Orth JD, Thiele I, Palsson BƘ. What is flux balance analysis? Nat Biotechnol. 2010;28(3):245ā€“8.

    ArticleĀ  CASĀ  Google ScholarĀ 

  45. Angione C. Integrating splice-isoform expression into genome-scale models characterizes breast cancer metabolism. Bioinformatics. 2018;34(3):494-501.

    ArticleĀ  Google ScholarĀ 

  46. Opdam S, Richelle A, Kellman B, Li S, Zielinski DC, Lewis NE. A Systematic Evaluation of Methods for Tailoring Genome-Scale Metabolic Models. Cell Syst. 2017;4(3):318ā€“29.

    ArticleĀ  CASĀ  Google ScholarĀ 

  47. Hucka M, Bergmann FT, Hoops S, Keating SM, Sahle S, Schaff JC, Smith LP, Wilkinson DJ. The systems biology markup language (SBML): language specification for level 3 version 1 Core. J Integr Bioinform. 2015;12(2):266.

    PubMedĀ  PubMed CentralĀ  Google ScholarĀ 

  48. Laibe C, Le NovĆØre N. MIRIAM resources: tools to generate and resolve robust cross-references in systems biology. BMC Syst Biol. 2007;1:58.

    ArticleĀ  Google ScholarĀ 

  49. Schellenberger J, Que R, Fleming RM, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S, Kang J, Hyduke DR, Palsson BƘ. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA toolbox v2.0. Nat Protoc. 2011;6(9):1290ā€“307.

    ArticleĀ  CASĀ  Google ScholarĀ 

  50. Stover NA, Krieger CJ, Binkley G, Dong Q, Fisk DG, Nash R, Sethuraman A, Weng S, Cherry JM. Tetrahymena genome database (TGD): a new genomic resource for Tetrahymena thermophila research. Nucleic Acids Res. 2006;34:D500ā€“3.

    ArticleĀ  CASĀ  Google ScholarĀ 

  51. Arnaiz O, Cain S, Cohen J, Sperling L. ParameciumDB: a community resource that integrates the Paramecium tetraurelia genome sequence with genetic data. Nucleic Acids Res. 2007 Jan;35(Database issue):D439ā€“44.

    ArticleĀ  CASĀ  Google ScholarĀ 

  52. Arnaiz O, Sperling L. ParameciumDB in 2011: new tools and new data for functional and comparative genomics of the model ciliate Paramecium tetraurelia. Nucleic Acids Res. 2011 Jan;39(Database issue):D632ā€“6.

    ArticleĀ  CASĀ  Google ScholarĀ 

  53. Swart EC, Bracht JR, Magrini V, Minx P, Chen X, Zhou Y, Khurana JS, Goldman AD, Nowacki M, Schotanus K, Jung S, Fulton RS, Ly A, McGrath S, Haub K, Wiggins JL, Storton D, Matese JC, Parsons L, Chang WJ, Bowen MS, Stover NA, Jones TA, Eddy SR, Herrick GA, Doak TG, Wilson RK, Mardis ER, Landweber LF. The Oxytricha trifallax macronuclear genome: a complex eukaryotic genome with 16,000 tiny chromosomes. PLoS Biol. 2013;11(1):e1001473.

    ArticleĀ  CASĀ  Google ScholarĀ 

  54. Burns J, Kukushkin D, Lindblad K, Chen X, Jonoska N, Landweber LF. A database of ciliate genome rearrangements. Nucleic Acids Res. 2016;44(D1):D703ā€“9. https://doi.org/10.1093/nar/gkv1190.

    ArticleĀ  CASĀ  PubMedĀ  Google ScholarĀ 

Download references

Acknowledgements

We are grateful to referees for the suggestions they provided to improve the paper during the revision process. We thank the European Commission Marie Sklodowska-Curie Actions for supporting the project.

Funding

Publication costs for this manuscript were sponsored by the European Commission Marie Sklodowska-Curie Actions H2020 RISE Metable ā€“ 645693 grant.

Availability of data and materials

All the material can be provided by the Corresponding Author on reasonable request and will be soon published.

About this supplement

This article has been published as part of BMC Bioinformatics Volume 19 Supplement 15, 2018: Proceedings of the 12th International BBCC conference. The full contents of the supplement are available online at https://bmcbioinformatics.biomedcentral.com/articles/supplements/volume-19-supplement-15.

Author information

Authors and Affiliations

Authors

Contributions

AM made the curated list of biochemical reactions and figures. FE, AO, CA and MC made the flux balance analysis. AM, FE and MC wrote and reviewed the manuscript. SP and PL conceived the project and supervised all the stages of the research and manuscript drafting. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Sandra Pucciarelli.

Ethics declarations

Ethics approval and consent to participate

N/A.

Consent for publication

N/A.

Competing interests

The authors declare that they have no competing interests.

Publisherā€™s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Databases and bioinformatics resources for ciliates [50,51,52,53,54]. (DOCX 14Ā kb)

Additional file 2:

List of organelles found in most ciliates. (DOCX 13Ā kb)

Additional file 3:

Links of the KEGG reactions. (XLSX 11Ā kb)

Additional file 4:

List of reactions. (XLSX 98Ā kb)

Additional file 5:

Simulation file. (XLSX 226Ā kb)

Additional file 6:

SBML version of the model. (XML 716Ā kb)

Additional file 7:

Matlab version of the model. (MAT 48Ā kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mancini, A., Eyassu, F., Conway, M. et al. CiliateGEM: an open-project and a tool for predictions of ciliate metabolic variations and experimental condition design. BMC Bioinformatics 19 (Suppl 15), 442 (2018). https://doi.org/10.1186/s12859-018-2422-9

Download citation

  • Published:

  • DOI: https://doi.org/10.1186/s12859-018-2422-9

Keywords