Skip to main content
Figure 6 | BMC Bioinformatics

Figure 6

From: Denoising PCR-amplified metagenome data

Figure 6

Error probability symmetries for Divergent (a) and (d), Artificial (b) and (e), and Titanium (c) and (f) data sets. (a)-(c): context-independent substitution error probabilities inferred by DADA with 95% confidence intervals based on binomial sampling error. Note the approximate symmetry between ij and ī j ̄ probabilities (which show up contiguously along the y-axis), where ī denotes the complement of nucleotide i. (d)-(f): All 96 reverse-complementary pairs of context-dependent error probabilities inferred by DADA for each data set. For each pair, the probability of the error away from an A or C is plotted on the x-axis and the error probability away from T or G is plotted on the y-axis. The pairing between these probabilities – seen by the tendency to lie along the diagonal – is stronger for the largest probabilities, which have the least sampling noise. The colors signify complementary pairs of errors red = (AG,TC) cyan=(CT,GA) green=(AT,TA) black=(CA,GT) blue=(AC,TG) purple=(CG,GC).

Back to article page