Skip to main content

Table 1 Some entries in the context matrix. Scores are derived from the ratio between frequency of the given word in the given position with respect to the gene (position +1 means immediately after the gene name, position +2 means two words behind, etc.) and the total frequency of the word.

From: Text Detective: a rule-based system for gene annotation in biomedical texts

WORD

POSITION

 

-3

-2

-1

+1

+2

+3

gene

0

0.5

5.0

5.0

0.5

0

function

0

1.8

0

2.1

0

0

cell

0

0

-2.5

-5.0

0

0