Skip to main content

Table 7 The distribution of the annotations and the statistics of agreement

From: The first step in the development of text mining technology for cancer risk assessment: identifying and organizing scientific evidence in risk assessment literature

 

A1

A2

Agreement

Disagreement

Carcinogenic activity

281 (0.55)

217 (0.50)

194 (0.78)

55 (0.22)

Mode of Action

158 (0.31)

172 (0.40)

129 (0.78)

36 (0.22)

Toxicokinetics

75 (0.15)

45 (0.10)

37 (0.62)

23 (0.38)

Irrelevant

0

2

0

2

Total

514

436

360 (0.76)

116 (0.24)

  1. The columns A1 and A2 correspond to the annotators 1 and 2, respectively. The values shown are the number of annotations by the annotator. The last two columns show the statistics of agreement and disagreement. Rows 2-4 show the results for the three sub-taxonomies and the last row indicates the number of irrelevant abstracts among the relevant ones.