Skip to main content

Advertisement

Table 1 CTD manual curation metrics

From: Text mining and manual curation of chemical-gene-disease networks for the Comparative Toxicogenomics Database (CTD)

Data Biocurator 1 Biocurator 2 Biocurator 3 Average
Total no. articles examined 112 112 112 112
No. articles curated (%) 57 (51) 74 (66) 69 (62) 67 (60)
No. articles rejected (%) 55 (49) 38 (34) 43 (38) 45 (40)
Time spent reviewing articles a 1331 893 2263 1496
Time spent on curatable articles (%) 1198 (90) 822 (92) 2133 (94) 1384 (93)
Time spent on rejected articles (%) 133 (10) 71 (8) 130 (6) 111 (7)
Curation rate (+/- SD) b 21.0 (31.1) 11.1 (13.1) 30.9 (52.9) 20.7
Rejection rate (+/- SD) c 2.4 (3.4) 1.9 (3.1) 3.0 (4.4) 2.5
Total data extracted d 828 2330 3039 2066
Data per curated article (+/- SD) 14.5 (34.4) 31.5 (143.7) 44.0 (209.8) 30.8
Data extraction rate (+/- SD) 0.5 (0.3) 1.4 (1.7) 0.6 (0.6) 0.8
  1. aAll times and rates were recorded or calculated in minutes
  2. bCuration rate = Time spent per curated article. SD = standard deviation.
  3. cRejection rate = Time spent per rejected article.
  4. Total data extracted = total number of chemical-gene, chemical-disease, and gene-disease interactions.
  5. dData extraction rate = macro-average of individual rates of the number of interactions for each curatable article.