Skip to main content

Table 11 Individual biological concept category agreement statistics

From: Construction of an annotated corpus to support biomedical information extraction

E. coli

Human

Category

N

F-score

Category

N

F-score

Gene

2010

90.55%

Gene

432

89.35%

Protein

771

51.88%

Protein

419

61.58%

Promoter

644

95.34%

Transcription_Factor

301

51.83%

Repressor

436

68.35%

DNA

298

63.08%

Operon

434

85.25%

Promoter

154

92.21%

Gene_Expression

407

78.62%

Transcription_Binding_Site

140

50.00%

Regulator

349

25.21%

Transcription

118

100.00%

Activator

345

42.32%

Cells

111

95.49%

Locus

192

72.91%

Regulation

66

96.97%

Enzyme

176

89.77%

Activator

65

9.23%

  1. Separate statistics are shown for the E. coli and human parts of the corpus. Within each part, categories are ordered according to their total number of assignments, as shown in the columns headed with N. Assignments by each pair of annotators are counted separately and added to the total.