Skip to main content

Table 2 Statistics for the datasets used in the experiments

From: Identification of transcription factor contexts in literature using machine learning approaches

 

TF data (positive data)

PPI data (noisy negative data)

NonPF data (negative data)

 

FlyTF

MeSH

GO

LLL

BioCreAtIvE

PICorpus

GeneRIF HIV

Prodisen

# sentences per resource

491

712

477

77

283

127

1200

1700

total # sentences

1680

1687

1700