Skip to main content

Table 3 Results for the FACS simHC metagenomic data set (105sequences, 269 bp)

From: A comparative evaluation of sequence classification programs

  actual CARMA MEGAN MetaPhyler MG-RAST
percentage of sequence classified   29.0 54.4 0.2 27.1
Eukaryota 73.0 30.3 42.0 0.0 21.0
Bacteria 25.6 62.8 52.0 84.0 71.5
Viruses 1.5 0.0 0.3 0.0 0.1
Archaea 0.0 6.9 5.7 16.0 7.3
percentage of sequence misclassified   8.0 12.2 16.0 7.6
correlation coefficient   0.45 0.72 -0.09 0.26
  1. The actual distribution of sequences compared to the distribution inferred by the alignment-based programs.