Skip to main content

Table 5 Validation of Results: Assignments.

From: Flexible taxonomic assignment of ambiguous sequencing reads

metric

length

∑ D i

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

E(D i )

[16S]

100

0.125

2.3%

3.9%

4.5%

5.1%

5.0%

5.4%

5.7%

6.2%

7.0%

9.6%

45.0%

 

150

0.080

3.5%

4.5%

4.9%

5.4%

4.9%

5.2%

5.2%

5.2%

5.2%

5.4%

50.0%

 

200

0.070

4.3%

4.8%

5.1%

5.5%

4.8%

5.2%

5.1%

4.8%

4.4%

4.1%

51.4%

E(D i )

[V1-V2]

100

0.077

4.6%

5.2%

5.6%

5.9%

5.3%

5.5%

5.3%

5.0%

4.9%

5.0%

47.1%

 

150

0.056

5.7%

6.0%

6.1%

6.3%

5.5%

5.7%

5.5%

4.7%

4.1%

3.4%

46.5%

 

200

0.023

7.1%

7.3%

7.3%

7.4%

6.2%

6.2%

5.4%

4.5%

4.0%

3.0%

41.0%

  1. Percentage of reads assigned at the node selected for each value of q when maximizing E(D i ) in simulations using the full-length 16S rRNA sequence (top) and the V1-V2 hypervariable region (bottom) for reads of length 100, 150, and 200 bp. The column ∑ D i indicates the best sum of distances achieved.