Fig. 1
From: Removing duplicate reads using graphics processing units

Clustering. Reads with an identical prefix of k nucleotides are considered potential duplicate reads. Image from [16] used under the terms of the Creative Commons Attribution License (CC BY)