Skip to main content

Table 11 Individual biological concept category agreement statistics

From: Construction of an annotated corpus to support biomedical information extraction

E. coli Human
Category N F-score Category N F-score
Gene 2010 90.55% Gene 432 89.35%
Protein 771 51.88% Protein 419 61.58%
Promoter 644 95.34% Transcription_Factor 301 51.83%
Repressor 436 68.35% DNA 298 63.08%
Operon 434 85.25% Promoter 154 92.21%
Gene_Expression 407 78.62% Transcription_Binding_Site 140 50.00%
Regulator 349 25.21% Transcription 118 100.00%
Activator 345 42.32% Cells 111 95.49%
Locus 192 72.91% Regulation 66 96.97%
Enzyme 176 89.77% Activator 65 9.23%
  1. Separate statistics are shown for the E. coli and human parts of the corpus. Within each part, categories are ordered according to their total number of assignments, as shown in the columns headed with N. Assignments by each pair of annotators are counted separately and added to the total.