Skip to main content

Table 3 Performance of assignment algorithms as a function of choice of correct answer

From: Identifying structural domains of proteins using clustering

Algorithm1

Correct answer

1-domain

2-domain

3-domain

4-domain

Overall2

SS

SCOP

75%

60%

46%

34%

70%

SS

CATH

83%

67%

47%

34%

74%

SS

SCOP or CATH3

83%

75%

64%

55%

79%

SS

SCOP (given)4

99%

86%

71%

75%

95%

CA

SCOP

75%

58%

45%

32%

69%

CA

CATH

81%

67%

45%

41%

73%

CA

SCOP or CATH3

82%

74%

64%

60%

79%

CA

SCOP (given)4

100%

84%

81%

69%

95%

  1. All runs are with m=22Å and s=5Å, and with adjacency constraint enforced, on the ASTRAL30 data set. 1CA refers to the α-carbon based algorithm and SS the secondary structure based one. 2Total, regardless of number of domains. 3Where SCOP and CATH differ, the choice which matched closest to our assignment was chosen in these runs. 4The algorithms were forced to cut into the number of domains specified by SCOP for each structure.