Skip to main content

Table 2 The distribution of number of rules learnt for different targets using HIall and HIseq. HIall can generally describe patterns using fewer rules. This is expected as it uses more background types of biological knowledge. Note the strange bimodal distribution for HIseq rules. The reason for this is unknown.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

Rule number HIall HIseq
0 423 485
1 425 371
2 701 228
3 133 137
4 38 81
5 37 49
6 37 38
7 14 31
8 11 119
9 2 4
10 1 0