Skip to main content

Table 4 Number of clusters (2 million sequences)

From: Acceleration of sequence clustering using longest common subsequence filtering

 

100 bases

150 bases

400 bases

CD-HIT

1,242,054

1,015,466

493,384

LCS-HIT

1,185,704

970,419

480,201