Figure 5From: Centroid based clustering of high throughput sequencing reads based on n-mer countsAssignment of reads depending on position. The fraction of reads assigned to the dominant (TPR) and other than the dominant (FPR) cluster as a function of the position in the genome of the Hepatitis B virus. The data are smoothed by averaging over the window of length 50. Different regions of the genome cluster differently, forming consistent patterns as a consequence of the changing nucleotide composition across the genome.Back to article page