From: A comprehensive assessment of N-terminal signal peptides prediction methods
 | Dataset for Experiment #1: Zhang and Henzel[20] (Experimentally verified SPs) | Dataset for Experiment #2: SPdb 5.1[33] (SPdb 5.1 is derived from Swiss-Prot Release 55.0) | Dataset for Experiment #3: UniProtKB/Swiss-Prot Release 57.0 (excludes datasets used in Experiment #1 and #2) |
---|---|---|---|
Positive | 270 human secreted recombinant proteins | 2,349 secretory proteins consisting of: | 228 secretory proteins consisting of: |
 |  | - Euk: 1874 | - Euk: 199 |
 |  | - Gpos: 168 | - Gpos: 17 |
 |  | - Gneg: 307 | - Gneg: 12 |
Negative | 270 human non-secretory proteins extracted from SigHMM [26] dataset which is in turn derived from Swiss-Prot Release 40.0. | 2,349 non-secretory proteins | 228 non-secretory proteins |
 |  | - Euk: 1874 (Cytoplasmic and nuclear)1 | Euk: 199 (Cytoplasmic and nuclear)4 |
 |  | - Gpos: 168 (all cytoplasmic)2 | - Gpos: 17 (all cytoplasmic)5 |
 |  | - Gneg: 307 (all cytoplasmic)3 | - Gneg: 12 (all cytoplasmic)6 |