Skip to main content

Table 2 Details of the three BioNLP 2013 ST datasets

From: Comparing neural models for nested and overlapping biomedical event detection

Dataset

Item

Train

Dev.

Test

CG 2013

Documents

300

100

200

 

Sentences

2640

850

1610

 

Pct unknown words

0%

10.63%

10.68%

 

 Events

9422

3217

5530

 

 Flat events

45.31%

44.07%

NA

 

 Nested events

34.95%

36.46%

NA

 

 Overlapping events

41.05%

43.05%

NA

 

 Inter-sentence events

4.08%

3.11%

NA

PC 2013

Documents

260

90

175

 

Sentences

1900

660

1254

 

 Pct unknown words

0%

11.66%

11.50%

 

Events

6,657

2320

4004

 

 Flat events

33.28%

34.74%

NA

 

 Nested events

38.90%

38.87%

NA

 

 Overlapping events

54.88%

52.80%

NA

 

 Inter-sentence events

4.70%

2.41%

NA

GE 2013

Documents

10

10

14

 

Sentences

1051

1104

1188

 

 Pct unknown words

0 %

16.84%

17.07%

 

Events

2882

3259

3301

 

 Flat events

53.57%

42.25%

NA

 

 Nested events

31.15%

38.96%

NA

 

 Overlapping events

26.57%

35.80%

NA

 

 Inter-sentence events

10.55%

22.55%

NA