Figure 1From: A grammar-based distance metric enables fast and accurate clustering of large sets of 16S sequencesAlgorithm overview. The algorithm operates on each sequence, s i , which is parsed into a suffix tree, t i , and dictionary, d i , for rapid distance comparison with other sequences. Each sequence is either added to an existing cluster, c j ∈ C, or becomes the initial representative sequence in a new cluster, c k .Back to article page