Skip to main content

Table 6 The results of the k-fold cross-validation with the different areas inder ROC curve and the optimal parameters. The varations in the optimal factors are due to some factors f i being an order of magnitude higher than the rest of the factors.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

k

AUROC Hiall

factor f Hiall

AUROC Hiseq

factor f Hiseq

5

0.6525 ± 0.059

9.6 × 10-5 ± 6.07 × 10-5

0.6135 ± 0.0589

6.0 × 10-2 ± 2.82 × 10-2

10

0.6728 ± 0.1085

7.8 × 10-5 ± 4.47 × 10-5

0.6391 ± 0.1022

7.5 × 10-2 ± 2.12 × 10-2

25

0.7342 ± 0.1234

6.6 × 10-5 ± 2.8 × 10-5

0.6951 ± 0.1029

7.56 × 10-2 ± 1.73 × 10-2