Skip to main content

Table 8 Datasets Compressibility-SPRING: Type 1 datasets: Space savings (in percentage) of SPRING when processing files of increasing size (on rows)

From: FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy

Dataset FASTA (%) FASTQ (%)
16GB 90 86
32GB 91 86
64GB 93 86
96GB 94 86
  1. Let F be an input genomic file and \(F'\) be the same file compressed using SPRING as stand-alone application, the space saving is computed as \(1 - \frac{\text {(size of F' in bytes)}}{\text {(size of F in bytes)}}\)