Skip to main content

Table 3 Comparison of the results for the Pfam domain families in data set #4 with the output of coreClust and comparison of these coreClust clusters with their matching families based on pClust

From: Alignment-free clustering of large data sets of unannotated protein conserved regions using minhashing

Pfam Family

|Pfam|

|coreCl|

\(\frac {|Pfam \cap coreCl|}{|Pfam|}\)

\(\frac {|Pfam \cap coreCl|}{|coreCl|}\)

|pClust|

\(\frac {|pClust \cap coreCl|}{|pClust|}\)

\(\frac {|pClust \cap coreCl|}{|coreCl|}\)

PF03880.12

1232

84

0.07

1

464

0.18

 

PF00271.28

1192

364

0.30

1

1083

0.33

0.99

PF00270.26

1187

260

0.22

1

1077

0.24

1

PF08298.8

424

200

0.47

1

359

0.55

0.98

PF06798.9

410

172

0.42

1

199

0.86

1

PF04245.10

384

47

0.12

1

93

0.50

1

PF12343.5

24

15

0.62

1

24

0.62

1