Skip to main content

Table 5 Sizes and gold label statistics of the splits for the Bacteria_Corpus

From: Extracting chemical reactions from text using Snorkel

Split

Abstracts

Candidates

Positives

Docs w. candidates

Docs w. positives

Bacteria_Train

872,591

8,928,937

–

417,404

–

Bacteria_Test

200

2398

43

96

13

Bacteria_Dev

223

2806

69

110

22

MetaCyc_Test

23

1212

49

23

15