Skip to main content
Figure 5 | BMC Bioinformatics

Figure 5

From: A linear classifier based on entity recognition tools and a statistical approach to method extraction in the protein-protein interaction literature

Figure 5

Decision surfaces of the VTT1 (top) and VTT5 (bottom) classifiers with SP (left) and bigram (right) textual features, for the documents in one of the 4 cross-validation subsamples using the training data. The decision surfaces are plotted with the parameters in Table 3, and x(d) and y(d) are computed according to Eq. (7) for every document d. The plots for VTT1 surfaces display many documents d with the same values of y(d), plotted in horizontal rows, while VTT5 displays a smoother ranking of documents. This happens because VTT1 uses information from a single NER tool (ABNER protein mentions), while VTT5 uses information from five such tools; thus, while in the VTT1 plot many documents have the same value of ABNER protein mentions, in the VTT5 plot the various NER measurements lead to a finer distinction between documents.

Back to article page