From: Swiftly Computing Center Strings
data set
first (5 species)
second (43 species)
number of sequences k
20
30
40
50
4
6
8
10
dirty columns
35.7
43.9
50.4
56.5
89.0
93.8
95.3
97.0