BMC Bioinformatics

Table 1 Summary statistics for the Master data set. NDom: Number of domains. LenAln: Length of alignment. PID_a: Average pairwise percentage identity. PID_w: Percentage identity across all members of a family. S_c: The structural similarity score. PSCR: Percentage of positions in a structurally conserved region.

From: OXBench: A benchmark for evaluation of protein multiple sequence alignment accuracy

	Min	Max	Mean	Median
NDom	2	122	5.7	3
LenAln	42	598	157.6	129
PID_a	5.1	98	53.6	52.2
PID_w	0.0	98.9	39.3	32.4
S_c	2.6	10.0	8.1	8.5
PSCR	2.5	100.0	74.5	81.0

Back to article page

ISSN: 1471-2105

Contact us

General enquiries: journalsubmissions@springernature.com