Skip to main content

Table 1 Accuracy of the different models using different sets of parameters

From: A tree-based conservation scoring method for short linear motifs in multiple alignments of protein sequences

Model

FPR

FNR

D lim

P lim

EXC DISC 1

0.22

0.14

0.30

1.00

EXC DISC 2

0.12

0.20

0.50

1.00

EXC CONT 1

0.19

0.17

0.50

0.80

EXC CONT 2

0.26

0.10

0.30

0.80

MIS DISC

0.39

0.16

0.30

1.00

  1. The parameters P lim and D lim determine the number of informative sequences considered when calculating the conservation score of each instance (see Conservation Score section for further explanation). In all the cases, the false positive and false negative rates are calculated for the optimal threshold (0.58), which maximises both the model's sensitivity and specificity. FPR = 1 - specificity = F P F P + T N MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqWGgbGrcqWGqbauaeaacqWGgbGrcqWGqbaucqGHRaWkcqWGubavcqWGobGtaaaaaa@3427@ FNR = 1 - sensitivity = F N F N + T P MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqcfa4aaSaaaeaacqWGgbGrcqWGobGtaeaacqWGgbGrcqWGobGtcqGHRaWkcqWGubavcqWGqbauaaaaaa@3423@