Skip to main content

Table 1 Summary statistics for the Master data set. NDom: Number of domains. LenAln: Length of alignment. PID a : Average pairwise percentage identity. PID w : Percentage identity across all members of a family. S c : The structural similarity score. PSCR: Percentage of positions in a structurally conserved region.

From: OXBench: A benchmark for evaluation of protein multiple sequence alignment accuracy

  Min Max Mean Median
NDom 2 122 5.7 3
LenAln 42 598 157.6 129
PID a 5.1 98 53.6 52.2
PID w 0.0 98.9 39.3 32.4
S c 2.6 10.0 8.1 8.5
PSCR 2.5 100.0 74.5 81.0