Skip to main content
Figure 1 | BMC Bioinformatics

Figure 1

From: Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction

Figure 1

Overlap statistics for E. coli MG1655 reads. The percent mismatch of the alignment between the two reads (a, b), the first quartile k-mer frequency of k-mers within the overlap (c), the median k-mer frequency (d), the third quartile k-mer frequency (e), and the comparative overlap score (f) are plotted for both true and false overlaps. The results are normalized for percentages of total overlaps for each of the true and false overlaps (a, c, d, e, f) or by overall count (b). The number of total true overlaps with 0 mismatches is 5,209,686.

Back to article page