Skip to main content

Table 1 Statistics of the datasets that are used in this study

From: SigUNet: signal peptide recognition based on semantic segmentation

Organism Signal Peptides Transmembrane Cytosolic or Nuclear Total
Train Comp Train Comp Train Comp
SignalP
 Eukaryotes 1640 606 987 939 5133 1000 7760
 Gram-positive 208 48 117 117 360 213 685
 Gram-negative 423 104 523 523 912 260 1858
SPDS17
 Eukaryotes 46 323 689 1058
 Gram-positive 9 189 240 438
 Gram-negative 23 89 99 211
  1. The SignalP dataset is from the UniProtKB/Swiss-Prot in accordance with the identity list in Pertersen et al.’s study [12]; The SPDS17 dataset is from the UniProtKB/Swiss-Prot in accordance with the identity list in Savojardo et al.’s study [6].