Skip to main content

Table 4 General corpus statistics

From: Construction of an annotated corpus to support biomedical information extraction

  Complete Corpus E. coli abstracts Human abstracts
No of abstracts 240 167 73
No of events 3067 2394 673
Average Events per abstract 12.78 14.34 9.22
Distinct nom. verbs annotated 91 81 36
Events centred on nominalised verbs 1274
(42%)
1066
(45%)
208
(31%)
Distinct verbs annotated 184 152 107
Events centred on verbs 1793
(58%)
1328
(55%)
465
(69%)
  1. Separate figures are shown for the complete corpus, the E. coli part of the corpus and the human part of the corpus.