- Open Access
SILVA tree viewer: interactive web browsing of the SILVA phylogenetic guide trees
BMC Bioinformaticsvolume 18, Article number: 433 (2017)
Phylogenetic trees are an important tool to study the evolutionary relationships among organisms. The huge amount of available taxa poses difficulties in their interactive visualization. This hampers the interaction with the users to provide feedback for the further improvement of the taxonomic framework.
The SILVA Tree Viewer is a web application designed for visualizing large phylogenetic trees without requiring the download of any software tool or data files. The SILVA Tree Viewer is based on Web Geographic Information Systems (Web-GIS) technology with a PostgreSQL backend. It enables zoom and pan functionalities similar to Google Maps. The SILVA Tree Viewer enables access to two phylogenetic (guide) trees provided by the SILVA database: the SSU Ref NR99 inferred from high-quality, full-length small subunit sequences, clustered at 99% sequence identity and the LSU Ref inferred from high-quality, full-length large subunit sequences.
The Tree Viewer provides tree navigation, search and browse tools as well as an interactive feedback system to collect any kinds of requests ranging from taxonomy to data curation and improving the tool itself.
Reconstructing phylogenetic trees is an important method for studying the evolutionary relations among organisms. In molecular phylogeny, genetic data are the basis for any kind of phylogenetic inferences. The ever growing amount of genetic data calls for approaches to dynamically visualize phylogenetic trees that comprise hundreds of thousands of sequences.
The SILVA project provides comprehensive, high-quality datasets of small (16S/18S, SSU) and large (23S/28S, LSU) subunit ribosomal RNA (rRNA) gene sequences for all three domains of life (Bacteria, Archaea, and Eukaryota) . These datasets include large phylogenetic guide trees which need to be interactively visualised for biologist seeking relationships and evolutionary information. For the current release (r128) of the SILVA dataset, the following trees are available:
SSU Ref NR99 inferred from 645,151 high-quality, full-length SSU rRNA gene sequences, clustered at 99% sequence identity.
LSU Ref inferred from 154,297 high-quality, full-length LSU rRNA gene sequences.
The SILVA trees include a large amount of metadata about each sequence, which requires several gigabytes of memory footprint, even in compressed form. Currently, browsing such large trees is only possible using desktop applications such as ARB , which is only available for Mac OS and Linux. Web-based solutions, like phylo.io  or iTOL , are not able to effectively handle trees of this dimension. The Open Tree of Life  relies on folding and allows the navigation of only one depth level at a time. OneZoom  supports large trees but relies on fractals to keep a manageable tree visualization and, hence, does not preserve meaningful branch lengths. In need of an effective, platform independent phylogenetic tree navigation tool, we developed the SILVA Tree Viewer as a web application.
The SILVA Tree Viewer has been developed to provide access to large scale (e.g. 38,585 taxa with 645,151 sequences for the SSU tree) trees in a web browser. The SILVA website hosts the viewer (version 1.1) to provide access to the SSU RefNr99 and LSU Ref phylogenetic guide trees of the SILVA database.
Results and discussion
The phylogenetic tree is presented to the users with pan and zoom functions similar to those in web mapping technology. A search function, as well as browsing of phyla and major clades, is provided to allow users to quickly lookup sequences and taxa. Additionally, the viewer is equipped with an interactive feedback system which allows reporting of taxonomic problems and asking questions to the tree curators, thus supporting them in improving the quality of the trees. The core components of the SILVA Tree Viewer’s main interface are shown in Fig. 1.
The tree can be navigated and zoomed in analogy to Google Maps. The different zoom levels allow rapid drill-down navigation from the overview to any area of interest. To further facilitate navigation in large trees, the Navigation bar indicates the current view in relation to the whole tree. The taxonomic groups are shown as coloured rectangles whose colour depend on the taxonomic rank of the group (Fig. 2), enclosing all sequences belonging to them. Further details of any sequence, provided by the SILVA database, can be shown by clicking on its label.
Taxonomic context information is essential for the understanding of the tree. The viewer provides taxonomic context about the currently displayed tree portion in two ways. The Taxon bar, at the bottom of the screen, shows the full path of the taxon under the mouse pointer, along with sequence counts. The Taxonomy Brackets, shown to the right of the visible tree, are vertical square brackets according to the vertical extension of taxonomic groups “crossing” the current view of the tree. They are shown from right to left according to their rank (domain being the rightmost). In case the brackets are too short for the corresponding label to be shown, no label is shown and the taxonomic information remains available via tooltip.
Search and browse tools
A search function is provided which allows searching for distinct elements of the tree. The search result is displayed in the viewer (Fig. 3).
Search for sequences
The user can search for sequences by accession number, returning any sequence whose accession number starts with the query. In this case, the search function can handle start and stop positions, separated by full stops (i.e. the complete accession is composed of <ACCESSION NUMBER > . < START>. < STOP>). Furthermore, a search by full name can be performed, returning all sequences whose species name contains the query string.
Search for taxa
The user can search for taxonomic groups by taxon name, returning all taxa whose name contains the query. The search by taxonomic path returns all taxa having the query as part of their full taxonomic path is also supported. In this case, since all taxa within a lower rank will match the search condition, only the highest (more generic) rank result will be returned. This search marks taxonomic groups as rectangular features which are loaded to the results navigator for browsing by the user.
All taxonomic groups which are direct children of any domain (phyla for Bacteria and Archaea, kingdom/major clades for Eukaryota) are listed for direct browsing. A click on the taxon name loads all taxonomic groups comprising it, zooms the view to the first result and allows navigation through results as for the other search types.
The feedback system is an important part of the SILVA Tree viewer since the SILVA guide trees are manually curated. The feedback system includes the possibility to report a so-called “data problem”: wrong classifications, errors in the taxonomy and any other problem with the phylogenetic data. This eases the curation process, thus supporting the quality improvement of the dataset in a cooperative effort with the users. Reporting a data problem is a two-step process: first, the data affected has to be selected by picking it interactively on the SILVA Tree Viewer, second, a report form has to be filled to provide details about the problem.
The feedback system can also be used to report on technical problems and to propose improvements to the viewer, as well as for asking general questions to the SILVA team.
Known limitations and future developments
In its current version (1.1), the SILVA Tree Viewer implements only visualization tools, thus no editing of the trees is supported. Due to technical constraints of the Web-GIS technology there is also no support for group folding, thus the tree is always shown as fully expanded.
Integration with the SILVA search and download system is planned to allow extended search functionality and the direct download of sequence data. The integrated feedback system will provide further guidance for the SILVA team in providing new features, taking into account user suggestions and recommendations.
The SILVA Tree Viewer enables users worldwide to navigate the large phylogenetic trees provided by the SILVA database. Hundreds of thousands of sequences can be searched in the tree and browsed in their taxonomic context. An interactive feedback system is provided to collect requests for improving the tool itself and guide data curation.
Availability and requirements
Project name: SILVA Tree Viewer.
Project home page: https://www.arb-silva.de/treeviewer
Operating system(s): Platform independent.
Other requirements: PostgreSQL including PostGIS, Apache.
Any restrictions to use by non-academics: none.
Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glockner FO. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41(Database issue):D590–6.
Ludwig W, Strunk O, Westram R, Richter L, Meier H, Yadhukumar BA, Lai T, Steppi S, Jobb G, et al. ARB: a software environment for sequence data. Nucleic Acids Res. 2004;32(4):1363–71.
Robinson O, Dylus D, Dessimoz C. Phylo.io: Interactive Viewing and Comparison of Large Phylogenetic Trees on the Web. Mol Biol Evol. 2016;33(8):2163–6.
Letunic I, Bork P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016;44(W1):W242–5.
Hinchliff CE, Smith SA, Allman JF, Burleigh JG, Chaudhary R, Coghill LM, Crandall KA, Deng J, Drew BT, Gazis R, et al. Synthesis of phylogeny and taxonomy into a comprehensive tree of life. P Natl Acad Sci USA. 2015;112(41):12764–9.
Rosindell J, Harmon LJ. OneZoom: A Fractal Explorer for the Tree of Life. PLoS Biol. 2012:10(10).
Postgis: a spatial database extender for PostgreSQL [http://www.postgis.net].
PostgreSQL: a powerful, open source object-relational database system. [https://www.postgresql.org].
Huerta-Cepas J, Dopazo J, Gabaldon T. ETE: a python Environment for Tree Exploration. BMC bioinformatics. 2010;11:24.
The authors would like to thank their fellow member of the SILVA team Jörg Peplies for providing the user perspective on the tool during its development; fellow members of the Max Planck Institute Hanka Medóva and David Probandt for providing early feedback and suggestions for improvement.
This work has been supported by the Federal Ministry of Education and Research (BMBF) project de.NBI (German Network for Bioinformatics Infrastructure) grant 031A539A and the Max Planck Society. None of funding bodies played any role in the design of the study and in writing the manuscript.
Availability of data and materials
The SILVA Tree Viewer, hosted by the SILVA project, is freely accessible without user registration at https://www.arb-silva.de/treeviewer.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.