Skip to main content

Table 5 Corpus statistics.

From: Semantic role labeling for protein transport predicates

  All Train Test
GeneRIFs 837 637 200
Words 21620 16446 5174
Unique words 3841 3249 1459
Predicates 911 693 218
Unique predicates 86 72 44
Unique predicate lemmas 34 28 25
Roles 1544 1159 385
AGENT roles 17 14 3
PATIENT roles 822 623 199
ORIGIN roles 173 128 45
DESTINATION roles 532 394 138
  1. This table shows some basic statistics for the semantic roles annotated over the GeneRIFs in the protein transport corpus.