Skip to main content
Fig. 3 | BMC Bioinformatics

Fig. 3

From: Alignment-free clustering of large data sets of unannotated protein conserved regions using minhashing

Fig. 3

F1 score comparison for data sets #1-#8 using different numbers of hash functions. Red, green, and blue lines represent comparisons with the output of the algorithm for h−d hash functions, pClust, and Pfam clusters, respectively. Dashed lines in each plot show the number of hashes where the termination condition was satisfied (for τ=0.9 and d=40). a data set #1, b data set #2, c data set #3, d data set #4, e data set #5, f data set #6, g data set #7, h data set #8

Back to article page