Skip to main content

Table 1 Estimated genome sizes and published assembly sizes for organisms used in this study

From: Decontaminating eukaryotic genome assemblies with machine learning

Organism

Estimated size (Mb)

Assembled sequence (Mb)

C. remanei

131 [39, 40]

118.36 [52]

C. latens

131

122.22

A. vaga

244 [63]

218.07 [35]

A. thaliana

125 [64]

135.67 [65]

C. elegans

100 [66]

103.02 [67]

D. melanogaster

175 [68]

142.57 [69]

T. rubripes

390 [70]

393.31 [50]

A. radiobacter

7.27

7.27 [71]

C. albicans

14.86

14.85 [72]

E. coli

4.64

4.64 [73]

P. aeruginosa

6.27

6.27 [49]

Ralstonia sp.

5.25

5.25

  1. Empirical study organisms are listed in the upper portion, simulated target organisms are listed in the center portion and simulated contaminants are listed in the lower portion of the table. There is no published estimate of genome size for C. latens and we used the genome size of the closely related [42] C. remanei as an estimated C. latens genome size