Skip to main content

Table 10 Comparison of reduced alphabets in terms of the ratio of high CN in the dataset by AA type.

From: Automated Alphabet Reduction for Protein Datasets

Amino Acid

High CN ratio

DualRMI

WW5

SR5

MU4

MM5

K

7.0%

1

1

1

1

1

E

9.8%

1

2

1

1

1

D

13.4%

1

2

2

1

1

Q

14.9%

1

1

1

1

1

R

15.1%

1

1

1

1

1

N

18.6%

1

1

2

1

1

P

20.6%

1

3

3

2

1

S

25.3%

2

1

1

2

1

T

26.3%

2

4

1

2

1

H

27.6%

2

4

1

1

2

G

30.2%

2

3

4

2

3

Y

38.0%

3

5

5

3

4

W

40.8%

3

5

5

3

4

A

41.1%

4

4

1

2

3

M

43.4%

4

5

5

4

4

L

44.8%

3

5

5

4

4

F

45.8%

5

5

5

3

4

V

49.2%

5

5

5

4

4

I

50.9%

5

5

5

4

4

C

53.5%

5

5

5

4

5

Trans.

--

5

9

9

8

6

Ave. range

--

8.7%

14.0%

12.6%

16.8%

10.2%

  1. Trans. = number of transitions between groups. Ave. range = average range of each reduction group, range is the difference between the maximum and minimum High CN ratio of the AAs of a group.