Skip to main content

Table 3 Real-world FASTQ data sets used for performance evaluation

From: Light-weight reference-based compression of FASTQ data

Data

Species

Read Length

Number of Reads

Size (GB)

Reference

ERR231645

E. coli

51

6,344,039

1.41

NC_000913

ERR005143

P.syringae

2*72

3,551,133

0.89

NC_007005

SRR352384

S. cerevisiae

2*76

26,030,832

9.88

NC_001136.10

SRR801793

L. pneumophila

2*100

5,406,461

2.75

NC_018140

SRR554369

Pseudomonas

2*200

1,657,871

0.82

KI517354

ERR654984

E. coli

64-502

1,167,295

1.21

NC_000913

ERR233152

P. aeruginosa

77

2,745,192

0.72

AP014622

SRR327342

S. cerevisiae

138

15,036,699

5.74

ACFL01000033