Skip to main content

Table 1 Statistics for training and development portions of applied corpora

From: Wide coverage biomedical event extraction using multiple partially overlapping corpora

Corpus

Entities

Events

Sentences

Words

GE

16,315

13,560

10,761

269,861

ID

8,501

2,779

3,412

83,063

EPI

10,094

2,453

7,827

170,809

DNAm

1,964

1,034

1,305

32,510

EPTM

4,698

1,142

3,692

82,994

mTOR

1,773

1,286

520

11,960

MLEE

3,553

4,491

1,931

37,483