Skip to main content

Table 2 Real datasets used for the evaluation of EC tools

From: Evaluation of the impact of Illumina error correction tools on de novo genome assembly

Abbr.

Organism

Reference ID

Genome size

Cov.

Sequencing platform

Read length

Trimmed reads

Dataset ID

Ref.

D1

Bifidobacterium dentium

Nc013714.1

2.6 Mbp

373 X

Illumina MiSeq

251 bp

 

SRR1151311

[23]

D2

Escherichia coli K-12 DH10B

NC010473

4.5 Mbp

418 X

Illumina MiSeq

150 bp

 

Ill. Data library

[10]

D3

Escherichia coli K-12 MG1655

NC000913

4.5 Mbp

612 X

Illumina GAII

100 bp

 

ERA000206

[10]

D4

Salmonella enterica

NC011083.1

4.7 Mbp

97 X

Illumina MiSeq

239 bp

\(\checkmark \)

SRR1206093

[23]

D5

Pseudomonas aeruginosa

ERR330008

6.1 Mbp

169 X

Illumina MiSeq

120 bp

\(\checkmark \)

ERR330008

[10]

D6

Homo sapiens Chr. 21

HG19

45.2 Mbp

29 X

Illumina HiSeq

100 bp

 

Ill. Data library

[10]

D7

Caenorhabditis elegans

WS222

97.6 Mbp

58 X

Illumina HiSeq

101 bp

 

SRR543736

[23]

D8

Drosophila melanogaster

Release 5

116.4 Mbp

52 X

Illumina HiSeq

100 bp

 

SRR823377

[23]