Skip to main content

Table 10 Comparison of reduced alphabets in terms of the ratio of high CN in the dataset by AA type.

From: Automated Alphabet Reduction for Protein Datasets

Amino Acid High CN ratio DualRMI WW5 SR5 MU4 MM5
K 7.0% 1 1 1 1 1
E 9.8% 1 2 1 1 1
D 13.4% 1 2 2 1 1
Q 14.9% 1 1 1 1 1
R 15.1% 1 1 1 1 1
N 18.6% 1 1 2 1 1
P 20.6% 1 3 3 2 1
S 25.3% 2 1 1 2 1
T 26.3% 2 4 1 2 1
H 27.6% 2 4 1 1 2
G 30.2% 2 3 4 2 3
Y 38.0% 3 5 5 3 4
W 40.8% 3 5 5 3 4
A 41.1% 4 4 1 2 3
M 43.4% 4 5 5 4 4
L 44.8% 3 5 5 4 4
F 45.8% 5 5 5 3 4
V 49.2% 5 5 5 4 4
I 50.9% 5 5 5 4 4
C 53.5% 5 5 5 4 5
Trans. -- 5 9 9 8 6
Ave. range -- 8.7% 14.0% 12.6% 16.8% 10.2%
  1. Trans. = number of transitions between groups. Ave. range = average range of each reduction group, range is the difference between the maximum and minimum High CN ratio of the AAs of a group.