Skip to main content

Table 3 Corpus statistics

From: Comparative analysis of five protein-protein interaction corpora

Corpus Per sentence average number of Fraction of sentences with
Tokens Entities Entity pairs Interactions No entities No interactions  
AIMed 25.2 2.2 3.0 0.5 18% 69%
BioInfer 31.3 4.2 9.4 1.3 ~0% 48%
HPRD50 26.1 2.8 3.0 1.1 0% 38%
IEPA 32.2 2.3 1.7 0.7 0% 37%
LLL 29.6 3.1 4.3 2.1 0% 0%