CySBGN: A Cytoscape plug-in to integrate SBGN maps
© Gonçalves et al; licensee BioMed Central Ltd. 2013
Received: 11 September 2012
Accepted: 27 December 2012
Published: 16 January 2013
Skip to main content
© Gonçalves et al; licensee BioMed Central Ltd. 2013
Received: 11 September 2012
Accepted: 27 December 2012
Published: 16 January 2013
A standard graphical notation is essential to facilitate exchange of network representations of biological processes. Towards this end, the Systems Biology Graphical Notation (SBGN) has been proposed, and it is already supported by a number of tools. However, support for SBGN in Cytoscape, one of the most widely used platforms in biology to visualise and analyse networks, is limited, and in particular it is not possible to import SBGN diagrams.
We have developed CySBGN, a Cytoscape plug-in that extends the use of Cytoscape visualisation and analysis features to SBGN maps. CySBGN adds support for Cytoscape users to visualize any of the three complementary SBGN languages: Process Description, Entity Relationship, and Activity Flow. The interoperability with other tools (CySBML plug-in and Systems Biology Format Converter) was also established allowing an automated generation of SBGN diagrams based on previously imported SBML models. The plug-in was tested using a suite of 53 different test cases that covers almost all possible entities, shapes, and connections. A rendering comparison with other tools that support SBGN was performed. To illustrate the interoperability with other Cytoscape functionalities, we present two analysis examples, shortest path calculation, and motif identification in a metabolic network.
CySBGN imports, modifies and analyzes SBGN diagrams in Cytoscape, and thus allows the application of the large palette of tools and plug-ins in this platform to networks and pathways in SBGN format.
The illustration of information in biology is frequently made using diagrams. Therefore, having a standard notation is very beneficial as it facilitates interpretation and forgoes the need for long and detailed explanations . Motivated by this, a large community of biochemists, modellers and computer scientists proposed the Systems Biology Graphical Notation (SBGN) . SBGN is an established effort for a standard graphical notation in biology. It leverages previous efforts, including Molecular Interactions Maps [2, 3] and Kitano process diagrams [4, 5], and extracts the best of graphical standard notations in other fields, such as Unified Modelling Language in software engineering.
SBGN is subdivided into three different and complementary languages: Process Description (PD), Entity Relationship (ER) and Activity Flow (AF). This subdivision allows SBGN to be accurate and assures an unambiguous representation . PD focuses on the representation of temporal changes occurring between biochemical entities. All molecular processes and interactions and their results are depicted in PD diagrams. In contrast, the ER language depicts the effects among entities, emphasizing the influences that each entity has on each other and disregarding the transformations on itself (which are dealt with in PD diagrams). Finally, AF graphically shows the influence of activities such as inhibition or activation. To better represent these influences the diagram is simplified by ignoring biochemical details of the process and entity states, reducing the number of nodes when compared to an equivalent PD. These three languages together enable the users to represent all types of biological information from biochemical interaction maps to cellular signalling networks.
A graphical notation can only be shared and analysed if it is supported by network visualization tools. SBGN is currently supported by a list of 24 software packages (http://www.sbgn.org/), including CellDesigner , PathVisio  and VANTED [8, 9]. Cytoscape [10, 11], arguably the most popular tool in bioinformatics for visualization and analysis of biological networks, has support for the Systems Biology Markup Language (SBML)  natively and via the CySBML plug-in , and BioPAX  via the BiNoM plug-in . However, it currently has no support for SBGN. The ability to import SBGN diagrams into Cytoscape would be very beneficial, as it would enable the use of all the network analysis features of Cytoscape and its plug-ins in SBGN maps. These include the analysis of the network properties (e.g. shortest paths, motif discovery), visualization of expression data, and pathway modelling.
Motivated by this, we have developed CySBGN, a new plug-in for Cytoscape that enables the full support of SBGN diagrams. CySBGN allows one to import and visualize SBGN diagrams stored in SBGN-ML format using the libSBGN library . It is also possible to export small changes made in the SBGN diagram, such as node positions. Full integration with CySBML  and Systems Biology Format Converter  is available, allowing the user to automatically generate SBGN diagrams from SBML models. To validate the rendering of the diagrams with the plug-in, we tested it with a suite of SBGN diagrams that covers the three sub languages as well as all its shapes and connections. The plug-in applicability is also demonstrated by applying network analysis methods, in particular shortest path and motif discovery, from two different plug-ins in a PD diagram of the central plant metabolism.
CySBGN enables the importation of SBGN diagrams into Cytoscape and it is compatible with the latest version of Cytoscape, version 2.8.3. All the entity and relationship nodes shapes defined in SBGN specifications for each language are supported.
All the information from the SBGN-ML file used to create and display the SBGN diagram is stored as node and edge attributes. Therefore, information such as node type (e.g. phenotype, macromolecule), node width and edge type (e.g. stimulation, necessary-stimulation) is available to the user through the Data Panel in Cytoscape.
CySBGN stores in the SBGN-ML files the coordinates of all entities in the diagram, allowing the user to export a defined layout. Thus, one can use CySBGN to take advantage of the several Cytoscape automatic layout algorithms and then share the generated layout among other SBGN compliant tools.
Cytoscape offers nine different node shapes, and therefore it is not possible to represent all the SBGN entities based on this list alone (Figure 2a). To overcome this we use the custom node graphics feature in Cytoscape (introduced in Cytoscape 2.8 ), which renders any specific image above the nodes. Thus, CySBGN encompasses all the SBGN shapes by loading a previously created image in the respective type of node.
In order to increase the rendering performance Cytoscape disables the rendering order of the nodes. In other words, nodes do not have a z coordinate, and therefore when two nodes are overlapping each other it is not possible to identify which node is in the front or back. Consequently, nodes are randomly ordered (see Figure 2c). This brings some representation problems mostly when the SBGN diagrams contain nodes that enclose other nodes, as is the case of compartments. To avoid rendering the compartment above the enclosed nodes and therefore hiding them, CySBGN draws every compartment node with transparent filling. This ensures that the enclosed nodes are visible when the compartment is not selected. However, it is not possible to guarantee the transparency of a node when it is selected.
Implemented methods in Cytoscape are optimized to work with simple network structures similar to SIF files. Hence, we created a simplification method that becomes useful in cases when the SBGN diagram structure does not fit the Cytoscape features, such as applying a layout algorithm to a SBGN diagram with cloned entities. In SBGN, a clone marker (dark shading in the bottom part of the entities) identifies nodes that appear duplicated in the network. Typical examples of clone entities are ADP or ATP, because they are present in more than one reaction. In these cases the hierarchical layout is wrongly affected by the presence of the cloned entities. Another example is when a necessary stimulation arc is present: due to its unsupported shape CySBGN represents it as an aggregation of two edges (see Results and discussion section for more details). This composition of edges affects the layout analysis of the network. The simplification feature duplicates the original diagram but disregards all nodes that are not supported by Cytoscape, such as compartments, tags or cloned entities. In addition, the edges that need to be drawn as a composition of other edges (i.e necessary stimulation, absolute inhibition and absolute stimulation) are simplified into a single edge (Figure 2b). Using the simplification feature the diagram information is retained, since the edge type attribute is preserved and only the edge shape is changed. Moreover, the simplification is represented in another diagram keeping the original one unchanged. In the CySBGN tutorial (http://www.ebi.ac.uk/saezrodriguez/cysbgn/files/tutorial.pdf) an example of a simplification of a map kinase cascade diagram is provided.
To increase the interoperability of CySBGN we developed an integrated feature between CySBGN and CySBML  that enables the users to automatically generate a SBGN diagram based on a SBML model. Taking advantage of the features made available by the SBFC , CySBGN generates a SBGN-ML file from a previously selected SBML model and consequently imports it into Cytoscape. This allows the SBML model visualization to be complemented with the respective SBGN diagram, providing the users the possibility to generate the most convenient visualisation of the model, store it, and share it among users and tools. The CySBGN tutorial (http://www.ebi.ac.uk/saezrodriguez/cysbgn/files/tutorial.pdf) provides a step-by-step guide of this feature, it also provides information on how to obtain SBML models from the BioModels database .
The accuracy of diagram rendering in CySBGN was validated by running a varied and extensive number of test cases containing maps from the three complementary languages of SBGN. For each case, we imported a SBGN-ML file and compared the generated diagram with the expected layout. The AF language was tested with 8 diagrams, the ER with 18 and the PD with 27, leading to a total of 53 different maps, covering almost all shapes and connection types. This suite of test cases is shared among other SBGN compliant tools, such as PathVisio  or SBGN-ED  (a VANTED [8, 9] plug-in which creates and edits SBGN diagrams). Hence, it is possible to compare the results obtained by CySBGN with the original (expected) diagrams and the ones obtained in other tools.
CySBML allows users to import SBML models directly from BioModels database , visualize and analyse them. We used this feature to test the automatic generation of SBGN diagrams from the SBML models. This allows one to pick a pathway from BioModels, download the respective SBML model and then generate the SBGN diagram that can be further used as a publication diagram or simply shared with other users. The plug-in’s tutorial is available on the CySBGN web page (http://www.ebi.ac.uk/saezrodriguez/cysbgn/files/tutorial.pdf) and provides detailed step-by-step guides to all CySBGN features. Additionally, the plug-in’s applicability is illustrated by using two different Cytoscape plug-ins to apply network analysis methods to SBGN diagrams. The first example shows the identification of the shortest path between pyruvate and succinate in a large metabolic network using the CytoHubba plug-in (http://hub.iis.sinica.edu.tw/cytoHubba/). The second example demonstrates how users can identify network motifs using the NetMatch  plug-in. Both examples use a large SBGN diagram of the plant central metabolism (1324 nodes and 1322 edges).
Future steps in CySBGN will focus on: (i) further integration with other Cytoscape plug-ins (e.g. BiNoM  integrates a wide variety of structure analysis methods and ways to convert the CellDesigner SBML extension to BioPAX and BioPAX to SBML), and (ii) improving the diagram export feature to allow a full mapping of the changes made to the SBGN diagram (i.e. allow the user to add/remove entities and arcs and also allow the storage of any other modifications made to the diagram). Furthermore, we will upgrade CySBGN compatibility to the upcoming version of Cytoscape, Cytoscape 3, when it is released.
Here we presented a new plug-in for Cytoscape, CySBGN, that provides support to SBGN diagrams. CySBGN allows one to load and visualize SBGN-ML models in Cytoscape and leverage all its repertoire of features and plug-ins. Among other features it also establishes a connection with the CySBML plug-in and SBFC, allowing the automated generation of SBGN diagrams of any imported SBML model. A detailed tutorial containing several step-by-step guides covering all CySBGN features is made available in the plug-in web page (http://www.ebi.ac.uk/saezrodriguez/cysbgn/files/tutorial.pdf).
Cytoscape’s active community allows the development of the plug-ins to be processed smoothly. Thus, features like extending Cytoscape’s supported edges and nodes shapes may be added in near future, consequently improving CySBGN. Moreover, there is an increasing number of plug-ins available , thus we expect more synergies of CySBGN with other plug-ins in the future.
Project name: CySBGN
Project home page: http://www.ebi.ac.uk/saezrodriguez/cysbgn/
Operating system(s): Platform independent
Programming language: Java
Other requirements: Java 1.6 or higher, Cytoscape 2.8.3
License: GNU GPL v3
Any restrictions use by non-academics: Only those imposed already by the license
We would like to thank Nicolas Rodriguez for helping integrating SBFC library into CySBGN and Francesco Iorio for drawing the CySBGN logo. We acknowledge the financial aid received from the EU through project “BioPreDyn” (ECFP7-KBBE-2011-5 Grant number 289434), and from the LLP/Erasmus Programme.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.