Skip to main content

Table 3 Corpus statistics

From: Comparative analysis of five protein-protein interaction corpora

Corpus

Per sentence average number of

Fraction of sentences with

Tokens

Entities

Entity pairs

Interactions

No entities

No interactions

 

AIMed

25.2

2.2

3.0

0.5

18%

69%

BioInfer

31.3

4.2

9.4

1.3

~0%

48%

HPRD50

26.1

2.8

3.0

1.1

0%

38%

IEPA

32.2

2.3

1.7

0.7

0%

37%

LLL

29.6

3.1

4.3

2.1

0%

0%