Skip to main content

Table 4 General corpus statistics

From: Construction of an annotated corpus to support biomedical information extraction

 

Complete Corpus

E. coli abstracts

Human abstracts

No of abstracts

240

167

73

No of events

3067

2394

673

Average Events per abstract

12.78

14.34

9.22

Distinct nom. verbs annotated

91

81

36

Events centred on nominalised verbs

1274

(42%)

1066

(45%)

208

(31%)

Distinct verbs annotated

184

152

107

Events centred on verbs

1793

(58%)

1328

(55%)

465

(69%)

  1. Separate figures are shown for the complete corpus, the E. coli part of the corpus and the human part of the corpus.