Skip to main content

Table 5 Most common words describing events

From: Construction of an annotated corpus to support biomedical information extraction

Combined

E. coli

Human

Event word

Count

(%)

Event word

Count

(%)

Event word

Count

(%)

Expression

N

362

(11.83)

Expression

N

309

(12.91)

Expression

N

53

(7.88)

Encode

V

175

(5.71)

Transcription

N

139

(5.81)

Encode

V

50

(7.43)

Transcription

N

171

(5.58)

Encode

V

125

(5.22)

Express

V

36

(5.35)

Bind

V

143

(4.66)

Bind

V

110

(4.59)

Bind

V

33

(4.90)

Regulation

N

119

(3.88)

Regulation

N

102

(4.26)

Transcription

N

32

(4.75)

Activate

V

106

(3.46)

Regulate

V

87

(3.63)

Activate

V

29

(4.31)

Regulate

V

106

(3.46)

Activate

V

77

(3.22)

Interact

V

21

(3.12)

Repress

V

82

(2.67)

Repress

V

72

(3.01)

Regulate

V

19

(2.82)

Require

V

73

(2.38)

Binding

N

61

(2.55)

Require

V

19

(2.82)

Activation

N

67

(2.18)

Repression

N

60

(2.51)

Involve

V

18

(2.67)

  1. Separate lists are shown for the corpus as a whole (combined), and for the separate E. coli and human parts of the corpus. For each word, its type is given (either (V) erb or (N)ominalised verb) together with an indication of the total number annotated events centred on the word and the percentage of all events in the corpus (or corpus part) that this figure represents.