Skip to main content

Table 1 Summary statistics for the Master data set. NDom: Number of domains. LenAln: Length of alignment. PID a : Average pairwise percentage identity. PID w : Percentage identity across all members of a family. S c : The structural similarity score. PSCR: Percentage of positions in a structurally conserved region.

From: OXBench: A benchmark for evaluation of protein multiple sequence alignment accuracy

 

Min

Max

Mean

Median

NDom

2

122

5.7

3

LenAln

42

598

157.6

129

PID a

5.1

98

53.6

52.2

PID w

0.0

98.9

39.3

32.4

S c

2.6

10.0

8.1

8.5

PSCR

2.5

100.0

74.5

81.0