Skip to main content

Table 1 CTD manual curation metrics

From: Text mining and manual curation of chemical-gene-disease networks for the Comparative Toxicogenomics Database (CTD)

Data

Biocurator 1

Biocurator 2

Biocurator 3

Average

Total no. articles examined

112

112

112

112

No. articles curated (%)

57 (51)

74 (66)

69 (62)

67 (60)

No. articles rejected (%)

55 (49)

38 (34)

43 (38)

45 (40)

Time spent reviewing articles a

1331

893

2263

1496

Time spent on curatable articles (%)

1198 (90)

822 (92)

2133 (94)

1384 (93)

Time spent on rejected articles (%)

133 (10)

71 (8)

130 (6)

111 (7)

Curation rate (+/- SD) b

21.0 (31.1)

11.1 (13.1)

30.9 (52.9)

20.7

Rejection rate (+/- SD) c

2.4 (3.4)

1.9 (3.1)

3.0 (4.4)

2.5

Total data extracted d

828

2330

3039

2066

Data per curated article (+/- SD)

14.5 (34.4)

31.5 (143.7)

44.0 (209.8)

30.8

Data extraction rate (+/- SD)

0.5 (0.3)

1.4 (1.7)

0.6 (0.6)

0.8

  1. aAll times and rates were recorded or calculated in minutes
  2. bCuration rate = Time spent per curated article. SD = standard deviation.
  3. cRejection rate = Time spent per rejected article.
  4. Total data extracted = total number of chemical-gene, chemical-disease, and gene-disease interactions.
  5. dData extraction rate = macro-average of individual rates of the number of interactions for each curatable article.