Skip to main content
Figure 4 | BMC Bioinformatics

Figure 4

From: Binning sequences using very sparse labels within a metagenome

Figure 4

Identification of an appropriate Clustering Percentage (CP). Five datasets for each of 5, 10 and 20 species are randomly sampled. The averages of S-GSOM's clustering performance for the datasets are plotted against Clustering Percentage (CP) values. A trend of decreasing in clustering performance with increasing CP can be noted. A compromised value of CP = 55% is marked where both the number of assigned nodes and clustering performance are high.

Back to article page