Figure 1From: Sequencing error correction without a reference genomeNumber of vertices plotted against sequence abundance. Number of vertices for each parent node (Y) plotted against abundance (X) for sequences of length 21. The theoretical curve given by the function Y = 3L [ 1 - (1 - p)X] ([14]), using p = 0.0004 is shown in grey. This function explains the general trend of the data but not the substantial variation in number of variants.Back to article page