Skip to main content

Table 2 Statistics for the datasets used in the experiments

From: Identification of transcription factor contexts in literature using machine learning approaches

  TF data (positive data) PPI data (noisy negative data) NonPF data (negative data)
  FlyTF MeSH GO LLL BioCreAtIvE PICorpus GeneRIF HIV Prodisen
# sentences per resource 491 712 477 77 283 127 1200 1700
total # sentences 1680 1687 1700
\