Skip to main content

Table 1 Detailed descriptions of tested genome datasets

From: CMIC: an efficient quality score compressor with random access functionality

Code

Datasets

Platforms

Organism

Bases (Mbp)

Read length

Size (Quality Score)

1

SRR1284073

PacBio

Escherichia coli

649.4

(130,10,000)

476,930,701 bytes

2

SRR327342

Illumina

S.Cerevisiae

2100

75

2,105,137,860 bytes

3

SRR870667

Illumina

T.Cacao

12,600

74 or 108

11,455,676,056 bytes

4

ERR091571

Illumina

Homo sapiens

42,700

100

43,133,335,476 bytes

5

SRR003187

LS454

Homo sapiens

803

(500,1000)

798,985,944 bytes

6

SRR003177

LS454

Homo sapiens

855

(500,1000)

850,464,554 bytes

7

SRR007215

ABI Solid

Homo sapiens

238.6

25

248,099,332 bytes

8

SRR010712

ABI Solid

Homo sapiens

431.6

35

443,972,736 bytes

9

SRR070253

ABI Solid

Homo sapiens

45,600

50

12,719,021,580 bytes

10

SRR801793

Illumina

Legionella pneumophila

1100

100

1,092,105,122 bytes

11

SRR14340293

OXFORD NANOPORE

Puccinia graminis

8900

(1000,10,000)

7,782,970,748 bytes