Skip to main content

Table 6 Evaluation of BioLexicon Coverage on JNLPBA-2004 dataset

From: The BioLexicon: a large-scale terminological resource for biomedical text mining

 

BLTagger

BT-Tagger

Full

55.54

47.96

Left

56.72

55.72

Right

59.24

55.63

  1. The table compares the performance of two dictionary-based POS taggers, the BLTagger and the BT-Tagger, in recognising protein names in the JNLPBA-2004 test dataset. For each tagger, F-scores are reported in terms of full matches (i.e., the sequence of words identified as a biomedical noun by the tagger matches exactly the text span annotated as a protein in the JNLPBA-2004 training dataset) as well as right and left boundary matches (i.e., the sequence of words identified as a biomedical noun by the tagger matches either the left or right boundary of the corresponding annotation in the JNLPBA-2004 training data).