Skip to main content

Table 7 Three selected examples of rules generated by HIall and HIseq. Where # rules is the total number of rules found, # pc is the number of positive examples covered in the training data, # pnc is the number of positive examples not covered in the training data, % CovP is the percentage coverage of the positive examples in the training data, % CovN is the percentage coverage of negative examples in the training data, # uc is the number of uncertain examples covered, and # unc the number of uncertain examples not covered.

From: Homology Induction: the use of machine learning to improve sequence similarity searches

HIall
HIall        
1CPC 2 120 0 100.00 1.00 1 2
1MPP 1 91 1 98.91 0.00 2 13
1MLA 1 17 5 77.27 0.00 4 13
HIseq
PDB # rules # pc # pnc % CovP % CovN # uc # unc
1CPC 2 89 31 74.17 1.20 1 2
1MPP 3 62 30 67.39 1.20 3 12
1MLA 1 16 6 72.73 0.20 5 12