Skip to main content

Table 8 Number of positive training and test samples for PROSITE patterns involved in Zinc finger meta-pattern

From: A stochastic context free grammar based framework for analysis of protein sequences

 

Number of known instances of the pattern

Number of known sequences containing the pattern

 

Total

Training set

Total

Positive test set

PS00028

10129 (88%)

9 (45%)

1598 (54%)

14 (14%)

PS00518

1092 (9%)

1 (5%)

1089 (37%)

32 (31%)

PS00752

7 (<<1%)

1 (5%)

7 (<<1%)

0 (0%)

PS01030

28 (<<1%)

4 (20%)

28 (1%)

3 (3%)

PS01102

20 (<<1%)

1 (5%)

20 (1%)

3 (3%)

PS01300

169 (1%)

3 (15%)

169 (6%)

22 (22%)

PS01358

117 (1%)

1 (5%)

73 (2%)

28 (27%)

Total

11562 (100%)

20 (100%)

2984* (100%)

102* (100%)

  1. *Some sequences contain instances of more than one pattern from the set.