Skip to main content

Table 7 Dictionary lookup performance.

From: Normalizing biomedical terms by minimizing ambiguity and variability

Method

Precision

Recall

F-score

Average lookup time (microsecond)

Bigram similariy (0.97)

0.758

0.587

0.661

6.7 × 105

Bigram similariy (0.95)

0.691

0.592

0.638

6.8 × 105

Bigram similariy (0.93)

0.612

0.610

0.611

6.8 × 105

No normalization

0.809

0.502

0.619

7

Case normalization

0.782

0.582

0.666

8

Heuristic normalization [18]

0.730

0.657

0.692

8

Automatic normalization

0.767

0.633

0.694

29

This table shows the speed and accuracy of dictionary lookup tasks using the human gene/protein dictionary and gene/protein name snippets. F-score is the harmonic mean of precision and recall. The values in the parentheses are the threshold values in soft string matching.