Skip to main content

Table 5 SP-86-ATOM

From: Compression-based classification of biological sequences and structures via the Universal Similarity Metric: experimental assessment

SP-86-ATOM

UCD

NCD

CD

 

UPGMA

NJ

UPGMA

NJ

UPGMA

NJ

Gzip

0.5349

0.5337

0.5376

0.5328

0.5338

0.5396

Bzip2

0.5779

0.5510

0.5779

0.5510

0.5632

0.5472

PPMd16

0.5425

0.5460

0.5361

0.5265

0.5567

0.5265

PPMd8

0.5550

0.5550

0.5443

0.5569

0.5562

0.5550

PPMd4

0.5454

0.5459

0.5394

0.5571

0.5482

0.5472

PPMd2

0.5348

0.5297

0.5412

0.5265

0.5418

0.5278

Huffman

0.5303

0.5265

0.5303

0.5265

0.5331

0.5365

Ac fast

0.5553

0.5385

0.5625

0.5524

0.5645

0.5413

Rc fast

0.5587

0.5477

0.5626

0.5389

0.5602

0.5472

Ac med.

0.5580

0.5493

0.5563

0.5438

0.5627

0.5272

Rc med.

0.5581

0.5583

0.5510

0.5434

0.5534

0.5492

Ac slow

0.5440

0.5314

0.5410

0.5265

0.5471

0.5463

Rc slow

0.5363

0.5265

0.5376

0.5265

0.5489

0.5328

BwtRleHuff

0.5408

0.5390

0.5408

0.5332

0.5557

0.5509

BwtMtfRleHuff

0.5411

0.5438

0.5411

0.5438

0.5420

0.5567

BwtRleAc fast

0.5365

0.5282

0.5365

0.5282

0.5323

0.5363

BwtMtfRleAc fast

0.5775

0.5421

0.5775

0.5421

0.5558

0.5747

BwtRleRc fast

0.5317

0.5362

0.5365

0.5462

0.5397

0.5265

BwtMtfRleRc fast

0.5791

0.5421

0.5791

0.5609

0.5558

0.5550

BwtRleRc med.

0.5338

0.5265

0.5338

0.5284

0.5340

0.5317

BwtMtfRleRc med.

0.5390

0.5550

0.5390

0.5550

0.5495

0.5405

BwtRleRc slow

0.5350

0.5385

0.5385

0.5419

0.5415

0.5415

BwtMtfRleRc slow

0.5338

0.5354

0.5338

0.5354

0.5420

0.5694

BwtWavelet

0.5362

0.5344

0.5362

0.5368

0.5339

0.5265

  1. Experimental results for the SP-86-ATOM data set, with the UCD (left), NCD (middle), and CD (right) distance. For each compression algorithm, we report the F-measure for both UPGMA and NJ methods.