Skip to main content

Table 2 The distribution of number of rules learnt for different targets using HIall and HIseq. HIall can generally describe patterns using fewer rules. This is expected as it uses more background types of biological knowledge. Note the strange bimodal distribution for HIseq rules. The reason for this is unknown.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

Rule number

HIall

HIseq

0

423

485

1

425

371

2

701

228

3

133

137

4

38

81

5

37

49

6

37

38

7

14

31

8

11

119

9

2

4

10

1

0