Skip to main content

Table 6 The results of the k-fold cross-validation with the different areas inder ROC curve and the optimal parameters. The varations in the optimal factors are due to some factors f i being an order of magnitude higher than the rest of the factors.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

k AUROC Hiall factor f Hiall AUROC Hiseq factor f Hiseq
5 0.6525 ± 0.059 9.6 × 10-5 ± 6.07 × 10-5 0.6135 ± 0.0589 6.0 × 10-2 ± 2.82 × 10-2
10 0.6728 ± 0.1085 7.8 × 10-5 ± 4.47 × 10-5 0.6391 ± 0.1022 7.5 × 10-2 ± 2.12 × 10-2
25 0.7342 ± 0.1234 6.6 × 10-5 ± 2.8 × 10-5 0.6951 ± 0.1029 7.56 × 10-2 ± 1.73 × 10-2