Skip to main content

Table 7 Experiment 4-File size overhead: Type 2 datasets FASTQ files: The Table Legend is as in Table 5

From: FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy

Dataset BZIP2 LZ4 ZSTD DSRC DSRC Fqzcomp SPRING
SA vs HS SA vs HS SA vs HS SA vs HS SA vs HU SA vs HU SA vs HU
H. Sapiens 1 (cov. 1.6x) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 4.76\%\) \(\sim 10.00\%\)
H. Sapiens 2 (cov. 14.4x) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 1.41\%\) \(\sim 43.71\%\)
H. Sapiens 3 (cov. 26.6x) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 0\%\) \(\sim 1.41\%\) \(\sim 154.43\%\)