- Software
- Open access
- Published:
The BioCyc Metabolic Network Explorer
BMC Bioinformatics volume 22, Article number: 208 (2021)
Abstract
Background
The Metabolic Network Explorer is a new addition to the BioCyc.org website and the Pathway Tools software suite that supports the interactive exploration of metabolic networks. Any metabolic network visualization tool must by necessity show only a subset of all possible metabolite connections, or the results will be visually overwhelming. Existing tools, even those that purport to show an organism’s full metabolic network, limit the set of displayed connections based on predefined pathways or other preselected criteria. We sought instead to provide a tool that would give the user dynamic control over which connections to follow.
Results
The Metabolic Network Explorer is an easy-to-use, web-based software tool that allows the user to specify a starting metabolite of interest and interactively explore its immediate metabolic neighborhood in either or both directions to any desired depth, letting the user select from the full set of connected reactions. Although, as for other tools, only a small portion of the metabolic network is visible at a time, that portion is selected by the user, based on the full reaction complement, and it is easy to switch among alternate paths of interest. The display is intuitive, customizable, and provides copious links to more detailed information pages.
Conclusions
The Metabolic Network Explorer fills a gap in the set of metabolic network visualization tools and complements other modes of exploration. Its primary strengths are its ease of use, diagrams that are intuitive to biologists, and its integration with the broader corpus of data provided by a BioCyc Pathway/Genome Database.
Background
Metabolic networks are large and complex graphs that can be daunting to comprehend. Past research has provided multiple modes for exploring metabolic networks. Highly localized modes include viewing single reactions, single metabolites and the reactions they participate in [1, 4, 6], and viewing predefined metabolic pathways, either singly or in connected groupings [1, 3,4,5,6, 11]. These approaches lack larger network context. Global modes of exploration include visualization of entire metabolic networks [9, 10, 12], or more precisely, of all metabolites combined with a limited set of reaction edges because including all reactions typically produces an incomprehensible tangle.
The Metabolic Network Explorer fills a gap in the existing modes of exploration, enabling the user to begin at a starting metabolite and interactively explore the metabolic neighborhood around that starting point by moving forward and backward in the network and building up a linear path of arbitrary length. The Explorer enables exploration of connected sets of reactions without the use of pre-defined pathways.
The Metabolic Network Explorer is a new addition to the BioCyc.org website and the Pathway Tools software suite. BioCyc [7] is a collection of Pathway/Genome Databases (PGDBs) for over 18,000 organisms, including EcoCyc, which describes the genes, metabolism, and other functions of E. coli K-12 MG1655, and MetaCyc, a database of metabolic reactions and enzymes from all branches of life. The BioCyc website is powered by Pathway Tools [8], a software environment for generating, maintaining, analyzing, visualizing, and web-publishing PGDBs. To access the Metabolic Network Explorer, go to https://BioCyc.org (or any other Pathway Tools website) and select Metabolic Network Explorer from the Metabolism menu. When an organism PGDB is selected, this tool will access the metabolic network of that organism. When MetaCyc is the selected PGDB, this tool accesses metabolites and reactions from all domains of life for applications such as metabolic engineering. Access to EcoCyc and MetaCyc is free; access to other PGDBs beyond a limited number of free monthly pageviews requires a paid subscription. Alternatively, users can install Pathway Tools locally, build a PGDB for any organism of interest from its annotated genome, and use the Metabolic Network Explorer to interactively explore its predicted metabolic network.
We are not aware of other tools that permit the sort of neighborhood-based exploration provided by the Metabolic Network Explorer. Metabolic databases such as BioCyc [7], KEGG [6], Reactome [3], BRENDA [4], and SMPDB [5] provide localized exploration such as visualization of individual reactions and pathways. Tools for visualizing global metabolic networks include offerings from BioCyc [10], KEGG [9], and Reactome [12]. MetExploreViz [2] is a web tool that supports visualization of arbitrary metabolic networks, laid out with a force-directed layout algorithm, but when networks are very large, these diagrams become very difficult to navigate and understand.
Implementation
The Metabolic Network Explorer is a web application with its display generated entirely using HTML, JavaScript and CSS. The primary data structure is a simple linear path consisting of a list of steps, such that each step consists of a metabolite and an optional successor reaction that links the metabolite to the next step. A second data structure records the metabolic neighborhood of every metabolite that has been retrieved from the database. The metabolic neighborhood of a metabolite M is a JavaScript object that includes a near-complete list of possible precursor and successor metabolites based on the set of reactions in the database. (A small number of ubiquitous inorganic molecules, such as water, H+, and phosphate, are omitted from lists of predecessor and successor metabolites.) A precursor metabolite is any metabolite P such that a reaction exists with P as a reactant and M as a product. A successor metabolite is any metabolite S such that a reaction exists with M as a reactant and S as a product. Some reactions are reversible or of unspecified directionality; in such cases, a metabolite could be either a predecessor or a successor, so is included in both categories. For every neighbor metabolite, we record the list of reactions that link the neighbor to M. The data for each reaction includes its full reaction equation, its EC number, the list of enzymes and genes that catalyze the reaction, and any pathways in which the reaction participates.
The user specifies a starting metabolite, and a query is issued to BioCyc or another Pathway Tools server to retrieve the immediate metabolic neighborhood of the selected starting metabolite. The web browser displays the starting metabolite with the list of possible precursors to the left, and the list of possible successors to the right. The user can mouse over any precursor or successor metabolite to see information about the linking reaction(s). The user can also click the plus icon next to any metabolite to add that metabolite to the path. If the metabolite to be added is a precursor to the first metabolite in the path or a successor to the last metabolite in the path, the path is extended in that direction. If the selected metabolite is connected to an intermediate metabolite on the current path, it will replace the existing portion of the path connected to the intermediate metabolite. Thus, only a single linear path (plus all the predecessors and successors for all metabolites on that linear path) can be displayed at a time. When a portion of a path is replaced, the original path is saved to a list of previous paths. The user can easily switch back and forth among previous paths. A path containing more than one metabolite is displayed as a vertical linear pathway diagram, with each reaction optionally including side metabolites, enzymes and genes, EC numbers, compound structures, and links to pathways.
Results
Figure 1 shows a short path that was generated by starting from the metabolite succinate in EcoCyc and interactively expanding backward two steps to 2-oxoglutaramate (top of figure) and forward one step to O-succinyl-l-homoserine (bottom). Other paths were explored along the way; they are listed in the Previous Paths section in the panel at right and can be restored by clicking on them. The display is information-rich, yet easy for a biologist to quickly comprehend. All the potential predecessor or successor metabolites of a given metabolite are collected into a single list (scrollable if the number is large, as is the case for predecessors and successors of oxoglutarate in the figure) displayed to the left or right of each central metabolite, minimizing diagram clutter. Color-coding on the plus icons indicates whether the reaction to and/or from that metabolite is unidirectional, potentially bidirectional, spontaneous, or if there are multiple reactions. When the path includes two metabolites connected by multiple reactions, as in the case of the middle two metabolites in the figure, the number of reactions is listed, and the user can select which is displayed. The diagram is customizable, allowing the user to select which display elements should be visible. Figure 2 shows a second example path in which the display settings have been changed to show compound structures and to suppress display of all predecessor and successor candidates except those at the start and end of the path. Nearly everything in the diagram is clickable. Clicking on a metabolite in the main path, a reaction arrow, or an enzyme, gene or pathway name will open the detailed data page for that object in a separate browser tab. Clicking on a metabolite in the predecessor or successor lists brings up a small panel containing hyperlinks to the metabolite and all reactions, enzymes, genes and pathways that connect to that metabolite, so that the user can examine them in more detail before deciding whether to extend the path to that metabolite. The “Generate Link” button in the right panel generates a URL to the Metabolic Network Explorer preset with the currently displayed path, so that a path can be easily saved or shared.
The result is a tool that enables easy, interactive browsing of a portion of the metabolic network. Unlike other more general network visualization tools, no prior preselection or preloading of a sub-network is required beyond the initial selection of the database of interest. Limiting the tool to showing only a single linear path at a time allows for a compact, easy-to-read, aesthetically appealing display, but full predecessor and successor metabolite lists visible for every metabolite in the selected path make it easy to identify and explore alternate paths. Ready access to detailed information pages for the reactions, pathways and enzymes associated with the current path provides important context.
Conclusions
The Metabolic Network Explorer is a new web tool that enables interactive generation and display of small metabolic subnetworks of interest without being limited to predefined pathways. Its primary strengths are its ease of use, diagrams that are intuitive to biologists, and its integration with the broader corpus of data provided by a PGDB. It complements other tools for interacting with metabolic networks and fills an important gap in the set of metabolic network visualization tools.
Availability of data and materials
The data used by this tool are part of the BioCyc data collection, available at https://BioCyc.org. Access to EcoCyc and MetaCyc are free. Access to data for other organisms requires a subscription after a period of free use. The Pathway Tools software is free to academic users with a signed license agreement, and is available for a fee to commercial users. EcoCyc and MetaCyc data are bundled with the software, with data for other organisms requiring a subscription. To obtain the software, see https://BioCyc.org/download.shtml. Source code is available to licensees upon request.
Abbreviations
- PGDB:
-
Pathway/Genome Database
References
Caspi R, Billington R, Fulcher CA, Keseler IM, Kothari A, Krummenacker M, Latendresse M, Midford PE, Ong Q, Ong WK, Paley S, Subhraveti P, Karp PD. The MetaCyc database of metabolic pathways and enzymes. Nucl Acids Res. 2018;46(D1):D633-39.
Chazalviel M, Frainay C, Poupin N, Vinson F, Merlet B, Gloaguen Y, Cottret L, Jourdan F. MetExploreViz: web component for interactive metabolic network visualization. Bioinformatics. 2018;34(2):312–3.
Jassal B, Matthews L, Viteri G, Gong C, Lorente P, Fabregat A, Sidiropoulos K, Cook J, Gillespie M, Haw R, Loney F, May B, Milacic M, Rothfels K, Sevilla C, Shamovsky V, Shorser S, Varusai T, Weiser J, Wu G, Stein L, Hermjakob H, D’Eustachio P. The Reactome pathway knowledgebase. Nucl Acids Res. 2020;48(D1):D498–503.
Jeske L, Placzek S, Schomburg I, Chang A, Schomburg D. BRENDA in 2019: a European ELIXIR core data resource. Nucl Acids Res. 2019;47(Database issue):D542-49.
Jewison T, Su Y, Disfany FM, Liang Y, Knox C, Maciejewski A, Poelzer J, Huynh J, Zhou Y, Arndt D, Djoumbou Y, Liu Y, Deng L, Guo AC, Han B, Pon A, Wilson M, Rafatnia S, Liu P, Wishart DS. SMPDB 2.0: big improvements to the small molecule pathway database. Nucl Acids Res. 2014;42(Database issue):D478-84.
Kanehisa M, Goto S, Sato Y, Kawashima M, Furumichi M, Tanabe M. Data, information, knowledge and principle: back to metabolism in KEGG. Nucl Acids Res. 2014;42(Database issue):D199–205.
Karp PD, Billington R, Caspi R, Fulcher CA, Latendresse M, Kothari A, Keseler IM, Krummenacker M, Midford PE, Ong Q, Ong WK, Paley SM, Subhraveti P. The BioCyc collection of microbial genomes and metabolic pathways. Brief Bioinform. 2017;20:1085–93. https://doi.org/10.1093/bib/bbx085.
Karp PD, Midford PE, Paley SM, Krummenacker M, Billington R, Kothari A, Ong WK, Subhraveti P, Keseler IM, Caspi R. Pathway tools version 24.0: integrated software for pathway/genome informatics and systems biology [v4]. 2020. p. 1–98. arXiv arXiv:1510.03964v4
KEGG Metabolic Chart. https://www.kegg.jp/kegg-bin/show_pathway?map01100.
Paley S, Billington R, Herson J, Krummenacker M, Karp PD. Pathway tools visualization of organism-scale metabolic networks. Metabolites. 2021;11(2):64.
Paley S, O’Maille PE, Weaver D, Karp PD. Pathway collages: personalized multi-pathway diagrams. BMC Bioinform. 2016;17(1):529–38.
Reactome pathway browser. https://reactome.org/PathwayBrowser.
Acknowledgements
Not applicable.
Funding
Research reported in this publication was supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number GM080746. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The NIH did not play any role in the design of the study; nor in collection, analysis, or interpretation of data; nor in writing the manuscript.
Author information
Authors and Affiliations
Contributions
SP designed and implemented the software and wrote the bulk of the manuscript. PK conceived of the tool and provided guidance and oversight, and contributed to the manuscript. Both authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent to publish
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Paley, S., Karp, P.D. The BioCyc Metabolic Network Explorer. BMC Bioinformatics 22, 208 (2021). https://doi.org/10.1186/s12859-021-04132-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12859-021-04132-5