BMC Bioinformatics

Table 12 Performance of BioHEL on learning CN and RSA using the alphabet optimized for the other dataset.

From: Automated Alphabet Reduction for Protein Datasets

Alphabet	% Acc. on CN dataset	% Acc. on RSA dataset
AA	74.0 ± 0.6	70.7 ± 0.4
DualRMI	73.3 ± 0.5	70.3 ± 0.4
DualRMI-alt	73.3 ± 0.7	69.53 ± 0.7•

A • marks reduced datasets where BioHEL obtains a performance significantly worse than the AA type dataset according to the statistical t-tests with a 99% confidence level. No significant differences were detected between the reduced alphabets.

Back to article page

ISSN: 1471-2105

Contact us

General enquiries: journalsubmissions@springernature.com