Skip to main content

Table 10 Effect of dictionary matching. The results are for a 10-fold cross validation. The results in Table 5 were for the evaluation test data; cross validation was not used. "GPD1" means the results using GPD1 in dictionary matching. They correspond to the 1st run in Table 5. "GPD1 with regexp." means the results using GPD1 with regular expressions in dictionary matching. They correspond to the 2nd run in Table 5. "GPD2" means the results using GPD2 in dictionary matching. They correspond to the 3rd run in Table 5. "GDP1-stop words" means the results when the stop words were not ignored in dictionary matching on the 1st run. The parenthesized values are p-values. The values in bold have a statistically significant difference from the 1st value. A difference is labeled statistically significant when the p-value is less than 0.05 on the Wilcoxon signed-ranks sum test (two-sided).

From: Gene/protein name recognition based on support vector machine using dictionary as features

 

GPD1

GPD1 with regexp.

GPD2

GPD1-stop words

Precision

0.8189

0.8199 (0.695)

0.8130 (0.131)

0.8099 (0.006)

Recall

0.7661

0.7668 (1.000)

0.7596 (0.160)

0.7600 (0.049)

Balanced f-score

0.7916

0.7924 (0.770)

0.7854 (0.105)

0.7841 (0.006)