Skip to main content

Table 2 Performance of the SpeedGene algorithm on two real datasets

From: Handling the data management needs of high-throughput sequencing data: SpeedGene, a compression algorithm for the efficient storage of genetic data

Dataset

Size

PLINK

Gzip

SpeedGene

Avg MAF

FHS

8.822 GB

564.6 MB

1.400 GB

460 MB

0.238637

COPDgene

161 MB

10.1 MB

20.5 MB

3.6 MB

0.057327

  1. File sizes of the FHS dataset and COPDgene dataset, compressed using PLINK, SpeedGene and Gzip.