Skip to main content

Table 8 Datasets Compressibility-SPRING: Type 1 datasets: Space savings (in percentage) of SPRING when processing files of increasing size (on rows)

From: FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy

Dataset

FASTA (%)

FASTQ (%)

16GB

90

86

32GB

91

86

64GB

93

86

96GB

94

86

  1. Let F be an input genomic file and \(F'\) be the same file compressed using SPRING as stand-alone application, the space saving is computed as \(1 - \frac{\text {(size of F' in bytes)}}{\text {(size of F in bytes)}}\)