From: TBGA: a large-scale Gene-Disease Association dataset for Biomedical Relation Extraction
Dataset | Split | Instances | Bags | Inst.s/bag | Relations |
---|---|---|---|---|---|
BioRel | Train | 534,277 | 39,969 | 13.37 | 125 |
Validation | 114,506 | 20,675 | 5.54 | ||
Test | 114,565 | 20,756 | 5.52 | ||
DTI | Train | 604,303 | 472,033 | 1.28 | 6 |
Validation | 6133 | 4769 | 1.29 | ||
Test | 6312 | 4817 | 1.31 | ||
TBGA | Train | 178,264 | 85,047 | 2.10 | 4 |
Validation | 20,193 | 10,491 | 1.92 | ||
Test | 20,516 | 10,494 | 1.96 |