Skip to main content
Figure 2 | BMC Bioinformatics

Figure 2

From: A grammar-based distance metric enables fast and accurate clustering of large sets of 16S sequences

Figure 2

Dictionary creation steps. Determining the order of the LZ dictionary, |d i |, for sequence s i . (a) The initial step in which the initial fragment, f1, is set to the first letter, s i (1), of the sequence. (b) The start of the k th step in which the k th letter, s i (k), is appended to the current fragment, fk. After the first k - 1 letters of s i are scanned for the occurrence of the fragment, f k , the two possible outcomes are (c) the fragment is reproducible with combinations of existing rules, or (d) the fragment is unique up to this point in the sequence, and so a new grammar rule is added to the dictionary and the fragment is reset.

Back to article page