Skip to main content

Table 8 Compression ratio

From: Compression-based classification of biological sequences and structures via the Universal Similarity Metric: experimental assessment

  CK-36-PDB CK-36-REL CK-36-SEQ SP-86-ATOM SP-86-PDB AA-15-DNA
Gzip 6.434 13.173 21.271 2.133 6.214 2.415
Bzip2 6.696 16.104 26.808 1.503 6.522 2.197
PPMd16 6.846 13.303 22.996 1.463 6.641 2.108
PPMd8 6.846 13.303 22.996 1.422 6.641 2.109
PPMd4 6.846 13.316 23.043 1.565 6.641 2.047
PPMd2 6.828 13.310 22.996 2.045 6.627 1.966
Huffman 6.307 14.356 25.867 3.590 6.095 2.152
Ac fast 6.991 16.903 28.847 3.573 6.711 1.951
Rc fast 6.952 17.170 30.337 3.581 6.679 1.955
Ac med. 6.650 15.682 27.718 3.533 6.411 1.951
Rc med. 6.631 15.929 29.161 3.536 6.391 1.956
Ac slow 6.350 14.551 26.745 3.542 6.140 1.955
Rc slow 6.364 14.954 28.298 3.545 6.153 1.958
BwtRleHuff 6.580 15.402 28.298 1.849 6.355 2.271
BwtMtfRleHuff 6.589 15.064 27.796 1.644 6.313 2.138
BwtRleAc fast 7.300 18.002 31.592 1.557 6.989 2.141
BwtMtfRleAc fast 7.393 17.345 30.933 1.491 7.033 2.037
BwtRleRc fast 7.266 18.281 33.224 1.558 6.962 2.145
BwtMtfRleRc fast 7.360 17.534 32.376 1.493 7.007 2.041
BwtRleRc med. 6.932 16.981 31.922 1.677 6.667 2.142
BwtMtfRleRc med. 7.025 16.455 31.294 1.561 6.715 2.035
BwtRleRc slow 6.654 15.961 30.824 1.793 6.415 2.142
BwtMtfRleRc slow 6.755 15.669 30.478 1.614 6.473 2.034
BwtWavelet 6.913 15.019 27.686 1.607 6.734 2.188
Gencompress - - - - - 1.933
  1. Average compression ratio, in bits per symbol, of the tested algorithms for the six data sets.