From: Comparative analysis of five protein-protein interaction corpora
Corpus | Per sentence average number of | Fraction of sentences with | ||||
---|---|---|---|---|---|---|
Tokens | Entities | Entity pairs | Interactions | No entities | No interactions | |
AIMed | 25.2 | 2.2 | 3.0 | 0.5 | 18% | 69% |
BioInfer | 31.3 | 4.2 | 9.4 | 1.3 | ~0% | 48% |
HPRD50 | 26.1 | 2.8 | 3.0 | 1.1 | 0% | 38% |
IEPA | 32.2 | 2.3 | 1.7 | 0.7 | 0% | 37% |
LLL | 29.6 | 3.1 | 4.3 | 2.1 | 0% | 0% |