Skip to main content

Table 1 Comparison of clustering methods based on Needleman-Wunsch alignment scores

From: Rapid analysis of metagenomic data using signature-based clustering

Wound Microbiome
46313157 sequences, average sequence length: 337
Method Time Clusters Intercluster   Intracluster
  (m)   Avg SD Avg SD
SigClust 16 5254 -68.8 125.1 219.4 107.7
Merged SigClust 18 1260 -73.9 118.0 180.9 136.9
UClust T=0.75 156 7141 -81.4 111.6 148.5 142.1
16S Genus 2586 2221 -82.4 104.6 98.6 167.4
16S Species 2586 8375 -76.9 110.8 125.6 162.5
16S Strains 2586 9354 -82.4 104.6 98.6 167.4
Oral Metagenome – Human (mgp41)[11]
1237319 sequences, average sequence length: 59
Method Time Clusters Intercluster   Intracluster
  (m)   Avg SD Avg SD
SigClust 0.2 17621 5.5 22.3 51.6 14.1
UClust T=0.75 1.7 17621 -4.4 14.4 38.8 13.3
PRJEB4688 [12]
5497442 sequences, average sequence length: 253
Method Time Clusters Intercluster   Intracluster
  (m)   Avg SD Avg SD
SigClust 1.62 6998 -94.8 126.6 250.1 77.2
UClust T=0.75 9 6998 -109.0 117.4 121.5 93.5
  1. Results are shown for the wound data, and for two previously published Illumina metagenomic datasets. We report for each method the clustering time in minutes and the number of clusters returned. The remaining columns of the table show the mean and standard deviation of the separation for the sampled intercluster and intracluster pairs